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HORMONE RESPONSE ELEMENT BINDING TRANSREGULATORS 

This application claims benefit of U.S. Provisional Application No. 60/508,763, filed October 3, 2003, 
which is incorporated herein by reference in its entirety. 

I. BACKGROUND OF THE INVENTION 
5 1 . Transcriptional regulation is very important in controlling cell growth. 

Transcription can be regulated by molecules that act as activators of transcription or as 
repressors of transcription. Disclosed are engineered modulators of transcription that are 
based on the covalent linking of regions that bind DNA. These molecules can then have 
repressor domains or activation domains added to modulate transcription. These molecules 
10 can be used to identify genes that are naturally controlled by certain transcription regulators, 
such as the estrogen receptor. Furthermore, they can be used to control transcription in cells 
at specific sites. 

H. SUMMARY OF THE INVENTION 

2. In accordance with the purposes of this invention, as embodied and broadly 

1 5 described herein, this invention, in one aspect, relates to hormone response element (HRE)- 
binding transregulators. 

3. Additional advantages of the invention will be set forth in part in the description 
which follows, and in part will be obvious from the description, or may be learned by 
practice of the invention. The advantages of the invention will be realized and attained by 

20 means of the elements and combinations particularly pointed out in the appended claims. It 
is to be understood that both the foregoing general description and the following detailed 
description are exemplary and explanatory only and are not restrictive of the invention, as 
claimed. 

III. BRIEF DESCRIPTION OF THE DRAWINGS 

25 4. The accompanying drawings, which are incorporated in and constitute a part of 

this specification, illustrate several embodiments of the invention and together with the 
description, serve to explain the principles of the invention. 

5. Figure 1 shows the construction and biochemical properties of ERE binding 
proteins. Figure 1A shows the cDNA for the C domain encoding residues 181-263 of ERa, 

30 and for the CD domain, residues 181-301, were joined for the corresponding cDNA of an 
ERE binding module, CC, CCD or CDC. The cDNA of an ERE binding module is flanked 
by multiple cloning sites (MCS) for the subsequent insertions of cDNAs for single or 
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multiple activation domains. Figure IB shows that cDNAs were transcribed and translated 
in vitro (TNT) in the presence of [ 35 S]-Met. Equal aliquots of reaction mixtures were 
subjected to 4-18% gradient SDS-PAGE followed by fluorography. V denotes the parent 
vector. Molecular weight markers in KDa are shown. Figure 1C shows the abilities of 
5 proteins synthesized in vitro in the presence of unlabeled Met to bind to the consensus ERE 
were assessed by EMSA. Equal aliquots of reaction mixtures were incubated with 0.05 nM 
[ 32 P] end-labeled DNA in the absence (-) or presence (+) of a Flag antibody and resolved by 
8% native PAGE. Free denotes the unbound radiolabeled ERE, while P-ERE indicates 
protein-ERE complexes. The ERa-ERE complex migrates as a doublet in EMSA, which is 

1 0 likely due to a truncated, post-translationally modified or monomelic ER that is capable of 
interacting with the ERE. This is also the case for the recombinant ERa observed in panel 
H. Figure ID shows the TNT reaction containing ERa or CDC shows no binding to the 
radiolabeled DNA fragment bearing a half-site ERE, S'-GGTCA-S*. Figure IE shows 
fluorographic detection of radiolabeled CDC, ERa, the DNA binding defective C*DC, or 

1 5 the DNA binding defective ERa*. Figure IF shows binding of equal amounts of TNT 
reaction containing CDC, C*DC, ERa or ERa* proteins to the ERE was assessed by 
EMSA. Figure 1G shows the identity of critical contact sites in the consensus ERE by ERa 
and CDC was assessed by the missing nucleoside hydroxyl radical assay followed by 
denaturing PAGE analysis. Lanes 1 and 6 indicate ERE containing no protein, whereas lane 

20 2 and 5 represent reactions containing CDC and ERa, respectively. Lanes 3 and 4 represent 
the Maxam-Gilbert reaction (G). The lane 7 (C) represents DNA subjected to hydroxyl 
radical treatment and the lane 8 indicates the uncut DNA (U) in the absence of protein. The 
ratio of free (F) to bound (B) DNA at each base was quantified and plotted. Ratios are 
represented as horizontal bars, the length of which approximates the strength of nucleoside 

25 contact with the protein. The ERE half-sites are boxed. A representative autoradiogram of 
several independent experiments is shown. Figure 1H shows the displacement of the 
radiolabeled ERE bound to CDC or ERa by unlabeled ERE. The construct-bound 
radiolabeled consensus ERE (0.05 nM) was incubated with 0, 0.125, 0.25, 0.5, 1, 2, 4 and 8 
nM unlabeled ERE. Bound and free fractions quantified by Phosphorlmager were used in 

30 the estimation of dissociation constant (Kd). A representative image of three independent 
experiments is shown. Figure II shows CDC competes with ERa to bind to ERE. ERa was 
incubated with the end-labeled consensus ERE for 30 min. The CDC at 1, 8 and 16-fold 
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more molar concentrations was then added into the reaction. Reactions were further 
incubated for 30 min and resolved with 8% non-denaturing PAGE. 

6. Figure 2 shows intracellular localization and transcription activation abilities of 
ERa and ERE binding proteins in transiently transfected COS-1 cells. Figure 2 A shows 

5 ERE binders, as ERa, are localized in the nucleus of transiently transfected COS-1 cells. 
Proteins were probed with a Flag antibody followed by a fluorescein-conjugated secondary 
antibody for visualization (FITC). DAPI staining indicates the nucleus. Figure 2B shows 
that in order to examine the transactivation abilities of constructs, cells were transfected 
with 300 ng expression vector per well bearing none (V) or cDNA for ERa, CDC, VP- 16, 

1 0 CDC-VP 1 6, ERa*, C*DC- VP 1 6, or CDC- VP 1 6* cDNA together with 1 25 ng of reporter 
plasmid. (*) denotes mutants with an impaired DNA binding function, while VP 16* depicts 
the activation function defective construct. The reporter plasmid contained either one (1 x 
ERE) or two copies (2 x ERE) of the consensus ERE in tandem located upstream of a 
simple promoter, TATA box, driving the expression of the firefly luciferase cDNA. The 

1 5 transfection efficiency was monitored by the co-expression of 0.5 ng of a reporter plasmid, 
pCMV-RL that drives the expression of Renilla luciferase cDNA. Cells were treated with 
or without 10" 9 M E2 for 24 h. The cell extracts were assayed for luciferase enzymes, and 
the normalized firefly I Renilla luciferase activities are presented as fold changes compare to 
the control, which was set to one. Shown are the mean + SEM of three independent 

20 experiments performed in duplicate. 

7. Figure 3 shows the effects of the promoter-type on the ability of an EBA to 
transactivate the ERE-driven reporter gene in COS-1 cells. Cells were transiently 
transfected with an expression vector bearing no cDNA (V), cDNA for ERa or for an EBA 
together with a single ERE-driven TATA box (ERE-TATA) or the thymidine kinase (ERE- 

25 TK) promoter for the expression of the firefly luciferase cDNA. Shown are the mean + 
SEM of three independent experiments performed in duplicate. 

8. Table 1 . The effect of increasing number of ADs and ERE on transactivation 
abilities of EBAs. COS-1 cells were transiently transfected with an expression vector 
bearing no cDNA (V) or cDNA for ERa, EBAs with single or two ADs of VP 16 or of p65 

30 fused to amino-, carboxyl- or both termini. Cells were also co-transfected with the reporter 
vectors bearing one (1 x ERE) or two copies (2 x ERE) of the consensus ERE located 
upstream of a simple TATA box promoter that drives the expression of the firefly luciferase 
cDNA. The mean + SEM represents three independent experiments performed in duplicate. 
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9. Table 2. Effects of the cell-type on transactivation abilities of EBAs. CHO or 
MDA-MB-231 cells were transiently transfected with an expression vector bearing no 
cDNA (V), cDNA for ERa or for an EBA. The reporter plasmid contained either one (1 x 
ERE) or two copies (2 x ERE) of the consensus ERE located upstream of the TATA box 

5 promoter driving the expression of the firefly luciferase cDNA. The mean + SEM of three 
independent experiments performed in duplicate indicates normalized luciferase values 
represented as fold change. 

10. Figure 4 shows transcriptional responses from non-consensus ERE and non-ERE 
sequences in COS-1 cells. Figure 4A shows cells were transfected with expression vectors 

10 bearing none (V), ERa, (VP16) 2 -CDC-(p65) 2 or (p65) 2 -CDC-(VP16)2 cDNA and treated 
with 10" 9 M E 2 for 24h. pS2, Oxytocin or Lactoferrin depicts 5 '-GGTC AcggTGGCC-3 5'- 
GGTGAcctTGACC-3' or 5 'GGTC AaggCG ATC-3 1 ERE sequence derived from the human 
pS2, oxytocin or lactoferrin gene ERE, respectively. An ERE drives the expression of the 
firefly luciferase cDNA as the reporter enzyme from the TATA box promoter. Figure 4B 

15 shows COS-1 cells were transfected with expression vectors bearing ERa, or (p65) 2 -CDC- 
(VP1 6^ cDNA together with the TATA box reporter vector. The reporter vector contains 
no (TATA) or two ERE half sites with 0, 1, 2, 3, 4, 5, 10 or 15 non-specific central 
nucleotides. Figures 4C and 4D show COS-1 cells were transfected with expression vectors 
bearing none (V), RXRa (Panel C) PR (Panel D), ERa, (VP16) 2 -CDC-(p65) 2 or (p65) 2 - 

20 CDC-(VP16) 2 cDNA. Cells were treated with 10" 6 M 9-cis-retinoic acid for RXRa, 10" 8 M 
Progesterone for PR or 10" 9 M E 2 for ERa for 24h. The TATA box reporter vector bears 
one consensus RXRE 5 AGGTC AnAGGTC A-3 (Panel C) or PRE 5'- 
AGAACAnnnTGTTCT-3* (panel D) that drives the expression of the reporter firefly 
luciferase cDNA. Normalized luciferase values represented as fold change are the mean + 

25 SEM of three independent experiments performed in duplicate. 

11. Figure 5 shows the effects of EBAs on transcriptional responses from ERE- 
dependent and -independent reporter constructs, and on cell cycle in MDA-MB-23 1 cells. 
(A&B) Cells were transfected with expression vectors bearing none (V), ERa, CDC, p65- 
CDC-VP16 or (p65) 2 -CDC-(VP16) 2 cDNA together with the pS2, C3 or Oxytocin (Oxy) 

30 promoter (A), or together with the Col or RARa promoter (B). All promoter constructs 

drive the expression of the firefly luciferase cDNA as the reporter enzyme. Cells were then 
treated without or with 10" 9 M E 2 (shown only for ERa) for 24h. Normalized luciferase 
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values represented as fold change are the mean + SEM of three independent experiments 
performed in duplicate. Figure 5C shows cells were transfected with pEGFP vector bearing 
none (EGFP), CDC, ERa or an EBA cDNA. One day after transfection, cells were 
subjected to a fluorescent-activated cell sorting (FACS) to separate EGFP -positive cell 
5 population. Cell cycle analysis was simultaneously performed with the EGFP-positive cells. 
The percentage of cells at Gl phase was assessed among treatment groups and expressed as 
percent change compared to cells transfected with EGFP vector bearing no cDNA. Shown 
is a representative experiment from several independent experiments. 

12. Figure 6 shows the effects of EBAs on transcriptional responses (A&B) from 
10 ERE-dependent and -independent reporter constructs, and on cell cycle (C) in MCF-7 cells. 

Transfection and processing of cells were done as described in legend of Figure 5. 

13. Figure 7 shows EBRs effectively repress the transcription of reporter gene from 
an ERE-containing heterologous promoter in COS-1 and CHO cells. To examine the 
transrepression abilities of constructs, cells were transfected with 75 ng expression vector 

15 per well bearing none (V) or cDNA for ERa, CDC, an EBR together with 125 ng of 

reporter plasmid. The reporter SV40 enhancer/promoter vector containing one consensus 
ERE juxtaposed to the promoter that drives the firefly luciferase cDNA expression. The 
transfection efficiency was monitored by the co-expression of 0.5 ng of a reporter plasmid, 
pCMV-RL that drives the expression of Renilla luciferase cDNA. Cells were treated with 

20 10" 9 M E2, 10" 7 M 4-OHT or 10" 9 M E2 + 10" 7 M 4-OHT for ERa, and without ligand for 
EBRs for 24 h. The cell extracts were assayed for luciferase enzymes, and the normalized 
firefly I Renilla luciferase activities are presented as fold changes compare to the control, 
which was set to 100. Shown are the mean + SEM of three independent experiments 
performed in duplicate. 

25 14. Figure 8 shows the effects of EBA to trans-repress the Catepsin D gene promoter 

in COS-1 cells. Transfection and processing of cells were done as described in legend of 
Figure 7. The reporter vector contains the human Catepsin D (CatD) gene promoter that 
drives the firefly luciferase cDNA expression. The mean + SEM indicates three 
independent experiments performed in duplicate. 

30 15. Figure 9 shows that EBAs trans-repress the expression of reporter gene from a 

reporter vector bearing the human C3 gene promoter that drives the firefly luciferase cDNA 
expression in COS-1 cells. Transfection and processing of cells were done as described in 
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legend of Figure 7. Shown are the mean + SEM of three independent experiments 
performed in duplicate and expressed as fold changes compare to the parent expression 
vector (V), which was set to one. 

16. Figure 10 shows in the upper panels tha the infection of MCF-7 and MDA-MB- 
5 231 cells with the recombinant adenovirus expressing the (5-galactosidase cDNA at different 

MOI. Cells were fixed, subjected to an in situ p-galactosidase assay for blue color 
development. Lower panels indicate the induction of P-galactosidase activity as a function 
of time in cells infected with the recombinant adenovirus expressing P-galactosidase cDNA 
at MOI of 500 for MCF-7 and 250 for MDA-MB-231 cells. Images were captured by CCD 
1 0 camera attached to a phase-contrast microscope. 

17. Figure 1 1 shows the regulation of endogenous E2 -responsive genes in MDA- 
MB-231 cells infected with recombinant adenovirus bearing no (CMV), ERa (E2-ERa) or 
p65-CDC-VP16 (PV) at MOI of 250 for 24h. ERa infected cells were treated with 10" 9 M 
E2 for 24h. The ARn indicates the intensity of fluorescence signal as a function of PCR 

15 cycle number. An experiment performed in duplicate is shown. 

1 8. Figure 12 shows the cell cycle progression in MDA-MB-23 1 cells. Cells were 
infected with the recombinant adenovirus expressing no (CMV) or the ERa in the presence 
of a physiological concentration, 10" 9 M, of E2, or PV cDNA. The data show that the 
transactivator PV, just as the E2-ERa, represses cell cycle progression in MDA-MB-231 

20 cells by blocking Gl/S phase transition. 

TV. DETAILED DESCRIPTION 

19. The present invention may be understood more readily by reference to the 
following detailed description of preferred embodiments of the invention and the Examples 
included therein and to the Figures and their previous and following description. 

25 20. Before the present compounds, compositions, articles, devices, and/or methods 

are disclosed and described, it is to be understood that this invention is not limited to 
specific synthetic methods, specific recombinant biotechnology methods unless otherwise 
specified, or to particular reagents unless otherwise specified, as such may, of course, vary. 
It is also to be understood that the terminology used herein is for the purpose of describing 

30 particular embodiments only and is not intended to be limiting. 
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A. Definitions 

21 . As used in the specification and the appended claims, the singular forms "a," 
"an" and "the" include plural referents unless the context clearly dictates otherwise. Thus, 
for example, reference to "a pharmaceutical carrier" includes mixtures of two or more such 

5 carriers, and the like. 

22. Ranges can be expressed herein as from ""about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment 
includes from the one particular value and/or to the other particular value. Similarly, when 
values are expressed as approximations, by use of the antecedent "about," it will be 

10 understood that the particular value forms another embodiment. It will be further 

understood that the endpoints of each of the ranges are significant both in relation to the N 
other endpoint, and independently of the other endpoint. It is also understood that there are 
a number of values disclosed herein, and that each value is also herein disclosed as "about" 
that particular value in addition to the value itself. For example, if the value "10" is 

15 disclosed, then "about 10" is also disclosed. It is also understood that when a value is 
disclosed that "less than or equal to" the value, "greater than or equal to the value" and 
possible ranges between values are also disclosed, as appropriately understood by the skilled 
artisan. For example, if the value "10" is disclosed the "less than or equal to 10"as well as 
"greater than or equal to 10" is also disclosed. It is also understood that the throughout the 

20 application, data is provided in a number of different formats, and that this data, represents 
endpoints and starting points, and ranges for any combination of the data points. For 
example, if a particular data point "10" and a particular data point 15 are disclosed, it is 
understood that greater than, greater than or equal to, less than, less than or equal to, and 
equal to 10 and 15 are considered disclosed as well as between 10 and 15. 

25 23. In this specification and in the claims which follow, reference will be made to a 

number of terms which shall be defined to have the following meanings: 

24. "Optional" or "optionally" means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where said 
event or circumstance occurs and instances where it does not. 

30 25. "Primers" are a subset of probes which are capable of supporting some type of 

enzymatic manipulation and which can hybridize with a target nucleic acid such that the 
enzymatic manipulation can occur. A primer can be made from any combination of 
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nucleotides or nucleotide derivatives or analogs available in the art which do not interfere 
with the enzymatic manipulation. 

26. "Probes" are molecules capable of interacting with a target nucleic acid, typically 
in a sequence specific manner, for example through hybridization. The hybridization of 
5 nucleic acids is well understood in the art and discussed herein. Typically a probe can be 
made from any combination of nucleotides or nucleotide derivatives or analogs available in 
the art. 



10 from genes stored in DNA, to RNA, such as mRNA in eukaryotes, and then to proteins. 
While certain virsus, such as retroviruses, have a different cycle because their genetic 
material is stored in RNA, one of the most fundamental processes of life is the control of the 
movement of the information contained in genes to the expressed protein product encoded 
by those genes. There are many points of control along this pathway, including the point at 

15 which the DNA is transcribed to RNA, producing the molecule which will be used to 

produce the protein product. Control of this process is very important, as many diseases, 
most notably cancer, arise because of aberrant control of this process, wherein either the 
wrong gene is being expressed in the wrong cell or too much of a particular gene is being 
expressed, or not enough of a particular gene, such as a negative cell cycle regulator, is 

20 being expressed. 

28. The process of turning the information in the DNA into RNA, is called 
transcription. The transcription process is a very complex and involves many different 
proteins, including the basal transcription machinery. The basal transcription machinery 
comprises more than 30 proteins and polypeptides. The basal transcription machinery 

25 functions at the start site, the TATA box, and is responsible for actually decoding the DNA 
and synthesizing the corresponding RNA of a given gene. 

29. In addition to the basal transcription machinery, typically other proteins are 
required for adequate and controlled transcription. These other proteins are broadly referred 
to as transcription modulators or transcription factors, such as transcription activators or 

30 enhancers, or transcription repressors. Transcription modulators typically function by 



these regions are in the 5' 1 leader sequence of a gene. There are a number of different types 



B. Compositions and methods 

27. The central dogma of molecular biology posits that genetic information flows 




ion of the DNA associated with the gene called a DNA binding site. Typically 
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of DNA binding sites, but there are characteristics that many DNA binding sites share. For 
example, DNA binding sites typically are made up of a precise DNA sequence, i.e. 5'- 
GGTCA-3\ It is possible that DNA binding sites can be made up of a set of sequences, i.e. 
5'-GGTCA-3' or 5'-GGCCA-3\ that are related, but have some differences. These 
5 differences can arise because, the contacts needed for binding between the protein and the 
DNA, either do not involve the variant position or because the variant position provides the 
appropriate contact with more than one specific nucleotide. 

30. Many transcription modulators bind the DNA as dimers, meaning there are two 
modulators, which are typically interacting with each other, as well as interacting with the 

10 DNA. This generally allows increased affinity and specificity to take place between the 
modulator and the DNA. The affinity is generally determined by how many binding 
contacts are made between the modulator and the DNA. When there are two modulators, 
the number of binding contacts increases, not only between the modulators and the DNA, 
relative to a single modulator, but also binding contacts between the two modulators 

15 increase. Specificity arises from the type of contact, i.e. the orientation of the contacts 

between the modulator and the DNA. When modulators bind as dimers, they can do so as 
homodimers or heterodimers. A homodimer is a dimer that is formed between two identical 
transcription modulators, and a heterodimer is a dimer that is formed between two different 
transcription modulators. 

20 31. As indicated, transcription modulators bind DNA, and the site that they bind 

DNA at is called a DNA binding site. As indicated, transcription modulators typically bind 
DNA as a dimer, in the natural context (i.e. the modulator may bind as a monomer, but in 
the natural state of the cell, the amount of the dimer is insufficient to bind the site as a 
monomer because of the low affinity, but does bind as a dimer, because of the increased 

25 affinity). As indicated, a dimer comprises two transcription modulators, and each of these 
modulators binds DNA at a DNA binding site in the DNA, which has a specific sequence or 
set of sequences. Thus, when a dimer binds, there will be two DNA binding sites, one for 
each monomer of the dimer. The combination of two sites can be referred to as an element, 
such as an enhancer element. 

30 32. Typically transcription modulators are composed of different functional 

domains. Most transcription modulators have a DNA binding domain, as discussed above. 
In addition, transcription modulators also typically have a domain that imparts an effect on 
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the basal transcription machinery, i.e. enhancement or repression. These domains can be 
called activation domains or repressor domains. In addition, there can be domains, for 
example, that cause nuclear localization, interact with other proteins, such as other 
modulators, or which interact with regulators of the modulator themselves. There can also 
5 be domains that connect the various domains of the modulator. Another type of domain is a 
dimerization domain. A dimerization domain, is a region of a transcription modulator that 
facilitates the interaction between two modulators, to form a dimer. Thus, the dimerization 
of two transcription modulators occurs through non-covalent interactions. 

33. There is a need for engineered transcriptional modulators. This need exists 

10 because it can allow the designer control of transcription for genes of choice. Furthermore, 
there is a need to be able to identify sets of genes that are activated by the same modulators. 
Disclosed herein are engineered transcriptional modulators. In certain embodiments these 
engineered transcriptional modulators are called linked modulators. Linked modulators are 
composed of at least two DNA binding domains that are covalently linked. Linked 

15 modulators can also have at least one other domain which modulates transcription, called a 
modulation domain. In certain embodiments these linked modulators can have the effect of 
creating constitutive transcriptional activation from a defined element that typically would 
not be constitutively active because the typical transcriptional dimer modulator for the 
element site is not a constitutive activator, but for example, requires that the modulator 

20 interact with another molecule or protein before activation takes place. Disclosed are 
properties of the linked modulators and uses of the linked modulators. 
1. Breast cancer treatments 
a) Endocrine therapy 

34. Although complementation and convergence of various signaling pathways are 
25 ultimately responsible for the physiology and pathophysiology of breast tissue, it is clear 

that estrogens are primary agents in the development of most breast cancers by stimulating 
and maintaining malignant cell proliferation. Consequently, measures that perturb the 
estrogen environment of the tumor cells by blocking the synthesis of estrogen or by 
preventing estrogen actions are current strategies for therapeutic intervention for the 
30 neoplasm. The management of early breast cancer is primarily based on surgical removal of 
the tumor by mastectomy or lumpectomy without or with radiotherapy, followed by an 
adjuvant systemic therapy dependent upon the ER status. 
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(1) GnRH antagonist 

35. GnRH regulates the synthesis and secretion of LH and FSH from the anterior 
pituitary (Shalev, E. et al. (2003) J Obstet Gynaecol Can 25, 98-1 13). GnRH-stimulated 
gonadotropin secretion can be blocked with antagonists as well as agonists whose sustained 

5 delivery induces pituitary desensitization (Limonta, P.et al. (2001) Expert Opin Investig 
Drugs 10, 709-720). These compounds ultimately reduce the circulating levels of 
gonadotropins and subsequently gonadal steroid hormone synthesis and secretion. Termed 
medical castration, this effect is exploited in the treatments of sex hormone-dependent 
neoplasms that also include breast (Robertson, J. F. et al. (2003) Eur. J. Cancer 39, 861-869; 

10 Grundker, C et al. (2003) Reprod Biol Endocrinol 1, 65). The GnRH agonist, goserelin, 
remains the treatment of choice for pre-menopausal patients with ER-positive breast 
cancers. It appears that a combination of goserelin and antiestrogenic compounds to 
produce an estrogen blockade is a more effective treatment regimen in prolonging 
progression-free survival than the use of a GnRH agonist alone (Robertson, J. F. et al. 

15 (2003) Eur. J. Cancer 39, 861-869; Grundker, C. et al. (2003) Reprod Biol Endocrinol 1, 
65). 

(2) Aromatase Inhibitors 

36. Since, as described above, estrogens are synthesized from androgenic steroid 
substrates by the aromatase enzyme, an effective perturbation of enzyme activity provides 

20 the most specific effects on estrogen production. Two major classes of aromatase inhibitors 
have been developed and are currently in clinical use. Type 1 inhibitors are steroidal 
analogues of androstenedione and bind to the same site as androstenedione on the aromatase 
molecule. However, unlike androstenedione these analogues bind to the enzyme 
irreversibly and covalently, because of their conversion to reactive intermediates by 

25 aromatase (Simpson, E. R. et al. (2002) Recent Prog. Horm. Res. 57, 3 17-338; Santen, R. J. 
(2002) J. Clin. Endocrinol. Metab. 87, 3007-3012). Therefore, Type 1 inhibitors are now 
commonly known as enzyme inactivators that include formestane and exemestane. Since 
the recovery of enzyme activity depends on both the re-synthesis of enzyme and the 
pharmacokinetics of the drug, these types of inhibitors have the potential for selectivity for 

30 the enzyme target and long-term effectiveness. However, such steroidal structures also have 
the potential for hormonal activity (Simpson, E. R. et al. (2002) Recent Prog. Horm. Res. 
57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. Metab. 87, 3007-3012). 
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37. Type II inhibitors are non-steroidal compounds that are triazoles and include 
anastrozole and letrozole. These type II inhibitors bind reversibly to the enzyme and fit into 
the substrate-binding site such that azole nitrogens interact with the heme prosthetic group 
in the aromatase enzyme with high affinity and specificity (Simpson, E. R. et al. (2002) 

5 Recent Prog. Horm. Res. 57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. Metab. 87, 
3007-3012). 

38. Aromatase inhibitors are not effective in pre-menopausal women, as lower 
circulating levels of estrogen could result in the stimulation of the hypothalamo- 
hypophyseal axis activity, which in turn increases circulating estrogen levels by enhancing 

10 estrogen synthesis from the ovaries (Simpson, E. R. et al. (2002) Recent Prog. Horm. Res. 
57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. Metab. 87, 3007-3012). Thus, 
application of aromatase inhibitors to treatment of pre-menopausal women with breast 
cancer is limited to their combined usage with goserelin. Since, however, the primary 
source of estrogen in post-menopausal women is the conversion of adrenal C19 steroids into 

15 estrogens by intra-tumor as well as extra-gonadal sites of aromatase activity, aromatase 

inhibitors constitute an effective therapeutic intervention for breast cancers (Simpson, E. R. 
et al. (2002) Recent Prog. Horm. Res. 57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. 
Metab. 87, 3007-3012). Studies indicate that aromatase inhibitor therapy leads to a 
precipitous drop in the intratumoral concentrations of estrogens together with a 

20 corresponding loss of intratumoral aromatase activity (Simpson, E. R. et al. (2002) Recent 
Prog. Horm. Res. 57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. Metab. 87, 3007- 
3012). Clinical trials have provided further support for the use of the aromatase inhibitors 
as first line treatment of ER positive breast cancers in post-menopausal women (Simpson, 
E. R. et al. (2002) Recent Prog. Horm. Res. 57, 317-338; Santen, R. J. (2002) J. Clin. 

25 Endocrinol. Metab. 87, 3007-3012). Since, however, aromatase inhibitors inhibit aromatase 
activity globally, these compounds could affect many other tissues wherein estrogens are 
required for normal function. The development of tissue-specific aromatase inhibitors could 
expand the utility of this approach in the treatment of breast cancers (Simpson, E. R. et al. 
(2002) Recent Prog. Horm. Res. 57, 317-338; Santen, R. J. (2002) J. Clin. Endocrinol. 

30 Metab. 87, 3007-3012). 
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(3) Antiestrogens 

39. In addition to estrogen, ER also binds compounds that act as estrogen 
competitors (McDonnell, D. P. (1999) Trends Endocrinol Metab 10, 301-311; Jordan, V. C. 
et al. (1999) Endocr. Rev. 20, 253-278; Jensen, E. V. et al. (2003) Clin Cancer Res 9, 1980- 

5 1989). These compounds can be divided into two categories: Type I and II. Type I 
compounds include tamoxifen, toremifene and raloxifene and are now referred to as 
selective estrogen receptor modulators, SERMs. Tamoxifen and toremifene have a 
triphenylethylene structure and raloxifene has a benzothiophene structure. Although the 
primary structure of these SERMs differs significantly from that of estrogen which is a 

10 cyclophenanthrene, they have conformations that allow them to bind to ERs. SERMs can 
function as agonists or antagonists depending on ER subtypes, and the cells and tissues in 
which they operate (McDonnell, D. P. (1999) Trends Endocrinol Metab 10, 301-311; 
Wakeling, A. E. (2000) Endocr Relat Cancer 7, 17-28). Tamoxifen and raloxifene function 
as antagonists in breast. While tamoxifen acts as an agonist in the uterus, bone and 

15 cardiovascular system, raloxifen functions as a pure antagonist in the uterus but an agonist 
in bone. 

40. Type II compounds that include steroidal compounds ICI 164,384 and ICI 
182,780 are derivatives of estrogen with long alkyl 7a-substitutions and are considered as 
pure antagonists devoid of estrogenic activity in most experimental systems tested 

20 (McDonnell, D. P. (1999) Trends Endocrinol Metab 10, 301-31 1; Wakeling, A. E. et al. 
(2001) Clin Cancer Res 7, 4350s-4355s; discussion 441 ls-4412s). The distinct 
pharmacological properties of these antiestrogens allow treatment regimens to be targeted to 
a specific tissue of interest to minimize unintended development of other tissue 
malignancies. 

25 41. Biochemical, functional and structural studies have indicated that antiestrogens 

alter the conformation of the carboxyl-terminal regions of ERs (McDonnell, D. P. (1999) 
Trends Endocrinol Metab 10, 301-31 1; Wakeling, A. E. et al. (2001) Clin Cancer Res 7, 
4350s-4355s; discussion 441 ls-4412s). Ligand binding is accompanied by a major 
reorganization in the tertiary structure of the LBD. Key differences in receptor 

30 conformation in the presence of different ligands are an indication for a structural basis for 
antagonism. Agonist binding induces a conformational change in which the carboxyl 
terminal helix 12 (HI 2), containing the core region of AF2, is aligned over the ligand- 
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binding cavity that is composed of helices 3, 5/6, and 1 1 . This alignment results in the 
formation of a specific binding site for the consensus LXXLL motif of co-activators. 
Binding of the Type 1 antagonists to ER sterically interferes with HI 2 positioning in that 
HI 2 interacts with a hydrophobic groove composed of residues from helices 3 and 5. This 
5 distinct orientation of H12 partially buries residues in the groove necessary for AF-2 

activity, thereby preventing co-factor recruitment (Brzozowski, A. M. et al. (1997) Nature 
389, 753-758; Pike, A. C. et al. (1999) EMBO J. 18, 4608-4618). 

42. In ICI-bound ER, the side chain of ICI completely prevents HI 2 from associating 
with the LBD. This disordered conformation is thought to lead to full antagonism that 

10 results in the destabilized ER structure leading to disruption of nuclear-cytoplasmic 

shuttling and increased receptor turnover (Dauvois, S. et al. (1992) Proc Natl Acad Sci U S 
A 89, 4037-4041; Dauvois, S. et al. (1993) J. Cell Sci. 106 ( Pt 4), 1377-1388). ICI 182,780 
(Faslodex) is approved as a "second-line" hormonal therapy for post-menopausal women 
with ER-positive metastatic breast cancer (Howell, A. et al. (2000) Cancer 89, 817-825). 

15 43. Although blocking the AF-2 function by antagonists suggests a passive role for 

the prevention of ER-mediated transactivation by anti estrogens, an active repression of gene 
transcription appears to be involved. Tamoxifen-ER is shown to recruit the co-repressors 
NCoR, SMRT (Lavinsky, R. M. et al. (1998) Proc. Natl. Acad. Sci. USA 95, 2920-2925; 
Shang, Y. et al. (2000) Cell 103, 843-852) and REA (Delage-Mourroux, R. et al. (2000) J. 

20 Biol. Chem. 275, 35848-35856) to the promoters of estrogen responsive genes. The 
subsequent recruitment of histone deacetylases (HDACs) to the repressor-ER complex 
causes deacetylation of histone proteins. This event leads to chromatin compaction and 
transcriptional repression. 

44. How does a SERM display partial agonist activity in an ER subtype and cell 

25 context dependent manner? The partial agonist activity of an antagonist is manifested as 
transcriptional responses from ERE-dependent genomic signaling pathway that are 
s ignifi cantly lower than those observed with the estrogen-ER complex. The partial 
agonistic effect of SERMs, particularly tamoxifen, bound ERot, but not ER0, from the ERE- 
dependent signaling pathway is modulated through the amino terminal AF-1 (Berry, M. et 

30 al. (1990) EMBO J. 9, 281 1-2818; Yi, P. et al. (2002) Mol. Endocrinol. 16, 1810-1827). It 
appeanrthat although the binding of a SERM to ERot prevents the AF-2 domain of the 
receptor from interacting with co-factors, the ability of the AF-1 domain to recruit the pi 60 
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family of co-factors in a cell-context dependent manner provides a mechanism for the 
partial agonistic effect of an antagonist for ERa (Yi, P. et aL (2002) Mol. Endocrinol. 16, 
1810-1827 ; Webb, P. et al. (1998) Mol. Endocrinol. 12, 1605-1618 ; Yi, P. et al.(2002) 
Mol. Endocrinol. 16, 674-693). Studies have shown that the tamoxifen-bound ERa recruits 
5 co-repressors, but not co-activators, to target promoters in breast cancer cells (Lavinsky, R. 
M. et al. (1998) Proc. Natl. Acad. Sci. USA 95, 2920-2925; Shang, Y. et al. (2000) Cell 103, 
843-852; Shang, Y. et al. (2002) Science 295, 2465-2468 ; Lee, E. J. et al. (2001) Mol. Med. 
7, 773-782). On the other hand, the tamoxifen-ERct complex interacts preferentially with 
the pi 60 family co-activators as well as co-repressors to target promoters to stimulate 

10 transcription in cells derived from endometrium (Shang, Y. et al. (2002) Science 295, 2465- 
2468). This allows the tamoxifen-ERct complex to induce transcription, albeit at lower 
levels than estrogen-ERoc, from estrogen responsive genes. Since the relative and absolute 
levels of expression of co-regulators vary among estrogen target cells, a balance between 
cell specific co-activators and co-repressors recruited by the antagonist-ERa complex 

15 appears to underlie the tissue selective pharmacology of SERMs (Shang, Y. et al. (2002) 
Science 295, 2465-2468; McKenna, N. J. et al. (1999) Endocr. Rev. 20, 321-344). 

45. It should be noted that anti estrogens could also affect the function of intracellular 
proteins and signaling independently from ER signaling pathways. These include changes 
in oxidative stress responses, activation of specific protein kinase C isoforms as well as 

20 alterations in calmodulin function and in cell membrane structure/function (Clarke, R. et al. 
(2001) Pharmacol. Rev. 53, 25-71). 

b) Endocrine resistance 

46. Current therapeutic approaches for breast cancer treatment utilize endocrine 
measures to counteract the effects of estrogens and are often successful in the remission of 

25 tumors (Nicholson, R. I. et al. (2000) Br. J. Cancer 82, 501-513; Clarke, R. et al. (2001) J. 
Steroid Biochem. Mol. Biol. 76, 71-84; Nicholson, R. I. et al. (2003) Breast Cancer Res. 
Treat. 80 Suppl 1, S29-34; discussion S35; Clarke, R. et al. (2003) Oncogene 22, 7316- 
7339). However, one-third of breast cancers fails to respond to endocrine therapy {de novo 
endocrine resistance). Moreover, the beneficial effects of antiestrogens are counteracted by 

30 the capacity of tumor cells to eventually circumvent such therapies, allowing the tumor cells 
to resume growth (acquired endocrine resistance). 

(1) De novo endocrine resistance 
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47. The most important factor in de novo resistance to endocrine therapies is the lack 
of ER expression. However, the ontology of de novo endocrine resistance cells is unclear. 
These populations could stem from ERa-negative epithelial cells that acquire autonomous 
growth properties. It is also possible that mitogenic changes in non-proliferate and ERa 
5 positive epithelial cells give rise to a phenotype that gains autonomous growth but loses its 
ability to express the ERa gene. Although the status of the ERp gene expression remains 
unknown in de novo resistant phenotypes, genetic alterations such as homozygous deletion, 
loss of heterozygosity or ERa gene mutation have not been reported to play a major role in 
the absence or loss of ER expression. Epigenetic control of ERa gene expression, on the 

10 other hand, appears to be critical for the absence/loss of the ERa gene transcription. CpG 
dinucleotides are frequently clustered into CpG islands and are often found in the promoters 
of genes (Chen, D. et al. (1999) Science 284, 2174-2177; Yang, X. et al. (2001) Endocr 
Relat Cancer 8, 115-127). Methylation of cytosines in these islands is associated with the 
repression of gene transcription (Chen, D. et al. (1999) Science 284, 2174-2177; Yang, X. et 

15 al. (2001) Endocr Relat Cancer 8, 1 15-127). Studies have indicated that the ERa gene 

contains CpG islands in its promoter and first exon (Falette, N. S. et al. (1990) Cancer Res. 
50, 3974-3978; Ottaviano, Y. L. et al. (1994) Cancer Res. 54, 2552-2555). These ERa CpG 
islands are unmethylated in normal breast tissue and ERa-positive tumor lines but they are 
methylated in about half of primary breast cancers and most ER-negative breast cancer cell 

20 lines (Ottaviano, Y. L. et al. (1994) Cancer Res. 54, 2552-2555; Piva, R. et al. (1989) 
Biochemistry International 19, 267-275). The methylation status of CpG islands is 
associated with reduced or absent ERa expression, consequently cessation of ER protein 
synthesis (Ottaviano, Y. L. et al. (1994) Cancer Res. 54, 2552-2555; Piva, R. et al. (1989) 
Biochemistry International 19, 267-275). DNA methylation is regulated by the members of 

25 DNA-cytosine methyltransferase (DNMT) family (Chen, D. et al. (1999) Science 284, 2174- 
2177; Yang, X. et al. (2001) Endocr Relat Cancer 8, 1 15-127). Studies have shown that 
methyltransferase inhibitors cause partial de-methylation and restoration of ERa mRNA 
expression and synthesis of functional ERa protein (Ferguson, A. T. et al. (1995) Cancer 
Res. 55, 2279-2283). A disregulated expression of DNMT in ERa-negative breast cancer 

30 cell lines is proposed to be associated with the ER-gene repression (Yang, X. et al. (2001) 
Endocr Relat Cancer 8, 1 15-127). 
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48. Methylation of the ERa gene is required but may not be sufficient for ERa gene 
repression. It appears that the acetylation status of the ERa gene also contributes to ERa 
gene silencing (Yang, X. et al. (2000) Cancer Res. 60, 6890-6894). Studies showed that an 
increase in the acetylation of histones and de-rnethylation of the ER CpG islands 
5 synergistically activate ERa expression (Yang, X. et al. (2001) Cancer Res. 61, 7025-7029). 
This suggests that DNMT and HDAC are key regulators of methylation-mediated ERa 
gene silencing. These findings also imply that DNMT and HDAC inhibitors could be 
potentially important in establishing hormone responsiveness, and consequently in breast 
cancer treatment. 

10 49. The underlying mechanisms for the methylation and acetylation status of the 

ERa gene promoter are unclear. Studies showed that the activation of the growth factor 
signaling pathways in breast cancer cells results in down-regulation of ERa gene expression 
(Pietras, R. J. et al. (1995) Oncogene 10, 2435-2446; Kumar, R. et al. (1996) J. Cell. 
Biochem. 62, 102-1 12; Tang, C. K. et al. (1996) Cancer Res. 56, 3350-3358) through, at 

15 least in part, an enhanced deacetylase activity (Mazumdar, A. et al. (2001) Nat Cell Biol 3, 
30-37). It is therefore possible that aberrant growth factor signaling is involved in the 
absence or loss of ER gene expression. Additionally, altered expression of transacting 
factors responsible for ERa transcription and/or abnormalities in post-transcriptional and 
translational processing of ERa could also contribute to the absence of ER synthesis 

20 (Weigel, R. J. et al. (1993) Cancer Res. 53, 3472-3474 ; Ferguson, A. T. et al. (1997) Crit. 
Rev. Oncog. 8, 29-46; Ferguson, A. T. et al. (1998) Cancer Treat. Res. 94, 255-278). 

50. Whatever the underlying mechanisms for the absence or loss of the ERa gene 
expression might be, an autonomous regulation of cell growth defines de novo resistance 
malignancies. Several growth factors and their receptors that include EGF, FGF, IGF, and 

25 TGF families have been shown to be over-expressed and to act as autocrine growth 

stimulators for breast cancer cells (Nicholson, R. L et al. (2000) Br. J. Cancer 82, 501-513; 
Clarke, R. et al. (2001) J. Steroid Biochem. Mol. Biol. 76, 71-84; Clarke, R. et al. (2003) 
Oncogene 22, 7316-7339). Increased expression of growth factor receptors correlates with 
the severity of the disease (Nicholson, R. I. et al. (2000) Br. J. Cancer 82, 501-513; Clarke, 

30 R. et al. (2001) J. Steroid Biochem. Mol. Biol. 76, 71-84; Clarke, R. et al. (2003) Oncogene 
22, 7316-7339). Receptors for growth factors are trans-membrane tyrosine kinases that are 
linked to activation of MAPK and/or AKT signaling pathways critical for cellular 
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transformation, cancer progression and resistance to endocrine therapy (Nicholson, R. L et 
al. (2000) Br. J. Cancer 82, 501-513; Clarke, R. et al. (2001) J. Steroid Biochem. Mol. Biol. 
76, 71-84; Clarke, R. et al. (2003) Oncogene 22, 7316-7339). Disrupting signal 
transduction by specifically modulating the activity of these trans-membrane tyrosine 
5 kinases, therefore, constitutes an important strategy in the development anticancer agents. 
This includes antibody therapy to block ligand binding to the receptors and administration 
of small molecule tyrosine kinase inhibitors to inhibit receptor tyrosine kinase activity. 

5 1 . The EGFR belongs to a family of tyrosine kinases that contains human epidermal 
growth factor receptor-1 (or HER1), HER2, HER3, and HER4 (Yarden, Y. (2001) Oncology 

10 61 Suppl 2, 1-13). Receptor activation is mediated by homo- and hetero-dimerization 

among all four HER family members upon binding to various ligands. Dimerization results 
in receptor tyrosine phosphorylation that allows the binding of downstream signaling 
molecules leading to the activation of kinases. Heterodimerization of HERs provides 
further diversification and specificity of signal transduction. Moreover, many other growth 

15 factor receptors can phosphorylate and activate HERs. HERs also act as a conduit for 

multiple other signaling pathways through trans-phosphorylation. HER2 is over-expressed 
in approximately 30% of breast cancers with adverse clinical prognosis (Slamon, D. J. et al. 
(1 989) Science 244, 707-712). Trastuzumab is a novel humanized monoclonal antibody 
that binds to the extracellular domain of HER2 (Modi, S. et al. (2002) Curr Oncol Rep 4, 

20 47-55). This leads to receptor down-regulation, degradation and consequently to inhibition 
of cell growth. Trastuzumab is currently being used in clinical settings for the treatment of 
patients with HER2-positive metastatic breast cancer with significant benefits as 
monotherapy or in combination with chemotherapy (Vogel, C. L. et al. (2002) J. Clin. 
Oncol. 20, 719-726; Slamon, D. J. et al. (2001) N. Engl. J. Med. 344, 783-792). Similarly, a 

25 humanized monoclonal antibody BX-EGF that targets the extracellular domain of HER1 has 
entered clinical trials for breast cancer treatments (Modi, S. et al. (2002) Curr Oncol Rep 4, 
47-55). 

52. Small molecule compounds compete for the ATP-binding sites of the tyrosine 
kinase domains of the HER- family. Binding of these compounds to the receptor block the 

30 activation of the tyrosine kinase domain and subsequently prevent the downstream signaling 
cascades that include MAPK and AKT pathways (Modi, S. et al. (2002) Curr Oncol Rep 4, 
47-55 ; Arteaga, C. L. et al. (2002) Semin. Oncol. 29, 4-10 ; Goel, S. et al. (2002) Curr 
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Oncol Rep 4, 9-19). The two most clinically advanced compounds in this class of agents are 
ZD1839 and OSI-774 that specifically target HER1, whereas CI-1033 interacts with all four 
members of the HER-family. In pre-clinical models ZD 1 839 displays antiproliferative 
activity by interfering with cell cycle progression in a wide range of HER-expressing cancer 
5 cell lines (Sliwkowski, M. X. et al. (1999) Semin. Oncol. 26, 60-70). ZD1839 also 

augments the antitumor effects of chemo- and radiation-therapies (Modi, S. et al. (2002) 
Chit Oncol Rep 4, 47-55; Arteaga, C. L. et al. (2002) Semin. Oncol. 29, 4-10; Goel, S. et al. 
(2002) Curr Oncol Rep 4, 9-19). However, recent clinical trials in patients with refractory 
metastatic breast cancer, suggest that EGFR inhibitor ZD1839 has no clinical activity 

10 (Arteaga, C. L. et al. (2004) Semin. Oncol. 31, 3-8). Pharmacodynamic studies (Arteaga, C. 
L. et al. (2004) Semin. Oncol. 31, 3-8) also indicate that the activated EGFR in breast tumor 
cells is indeed blocked by EGFR tyrosine kinase inhibitors but without an associated 
reduction in tumor cell proliferation. These results imply that 1) levels of P-EGFR do not 
predict for EGFR dependence nor sensitivity to therapeutic EGFR blockade, and 2) drug- 

1 5 induced inhibition of P-EGFR is not predictive of response to treatment either. 

(2) Acquired endocrine resistance 
53. Counteraction of the beneficial effects of endocrine approaches by the tumor 
cells that express ER leads to acquired endocrine resistance phenotypes, in which the cells 
are no longer growth inhibited by antiestrogens (Nicholson, R. I. et al. (2000) Br. J. Cancer 

20 82, 501-513; Clarke, R. et al. (2001) J. Steroid Biochem. Mol. Biol. 76, 71-84; Nicholson, 
R. L et al. (2003) Breast Cancer Res. Treat. 80 Suppl 1, S29-34; discussion S35; Clarke, R. 
et al. (2003) Oncogene 22, 7316-7339). It is certain that endocrine resistance is multi- 
factorial. Since breast cancers display a remarkable phenotypic heterogeneity as a result of 
distinct gene expression profiles (Perou, C. M. et al. (2000) Nature 406, 747-752; Sorlie, T. 

25 et al.(2001) Proc Natl Acad Sci U S A 98, 10869-10874), each cancer type likely utilizes a 
different resistance mechanism. Nonetheless, aberrations in ER signaling pathways appear 
to be critical events that drive the response and resistance to antiestrogens. A rise in the 
population of ER mutants as ligand-independent, constitutively active or dominant-negative 
phenotypes, is postulated to contribute to the endocrine resistance of tumors (Murphy, L. C. 

30 et al. (1997) Ann. Med. 29, 221-234; Leygue, E. et al. (1998) Cancer Res. 58, 3197-3201). 
Despite the fact that ERct and ER(3 possess similar structural and biochemical properties, 
they display distinct activation properties for the expression of estrogen responsive genes. 
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An alteration in the relative levels of ERct and ERP when co-synthesized could, therefore, 
contribute to endocrine resistance by offsetting the balance between the regulatory potentials 
of ER-subtypes (Lazennec, G. et al. (2001) Endocrinology 142, 4120-4130 ; Speirs, V. et aL 
(1999) Cancer Res. 59, 525-528 ; Speirs, V. et al. (1999) Cancer Res. 59, 5421-5424). 
5 Aberrations in signaling pathways converging onto ER (post-translational processing) 
and/or ER-mediated events (promoter cross-talk) could also contribute to resistance by 
altering the sensitivity of ligand-ER mediated events or by circumventing the need for 
ligand-driven cell-growth (Kato, S. et al. (1998) Oncology 55 Suppl 1, 5-10; Nicholson, R. 
I. et al. (1999) Endocr Relat Cancer 6, 373-387). 

10 54. Alterations in co-regulator expression or availability could also be one 

mechanism for the development of endocrine resistance. Tamoxifen resistance is 
characterized not only by the ineffectiveness of the compound to inhibit tumor growth but 
also by a gained ability to act as a partial agonist in breast cells. Co-regulatory proteins are 
present at rate-limiting levels in cells such that modification in the level of co-regulator 

15 expression or activity could lead to alterations in the ER signaling, consequently endocrine 
resistance (Shang, Y. et al. (2002) Science 295, 2465-2468). As discussed above, the 
- transcriptional activity of the tamoxifen-ERa complex is modulated by the ratio between co- 
activator and co-repressor recruited to the complexes in cells within which tamoxifen acts as 
a partial agonist (Fujita, T. et al. (2003) J. Biol. Chem. 278, 26704-26714). A decrease in 

20 the level or activity of co-repressors (Lavinsky, R. M. et al. (1998) Proc. Natl. Acad. Sci. 
USA 95, 2920-2925; Graham, J. D. et al. (2000) J. Steroid Biochem. Mol. Biol. 74, 255- 
259; Graham, J. D. et al. (2000) Steroids 65, 579-584) with or without a concurrent increase 
in the level of co-activators (Hudelist, G. et al. (2003) Breast Cancer Res. Treat. 78, 193- 
204; Font de Mora, J. et al. (2000) Mol. Cell. Biol. 20, 5041-5047) could therefore play a 

25 critical role in the development of tamoxifen resistance in ER positive breast cancers. 

55. Studies showed that ERa positive breast cancer cells that are resistant to the 
growth-inhibitory effects of tamoxifen remain sensitive to growth inhibition by ICI 182,780 
i n experime ntal models in situ (Clarke, R. et al. (2001) Pharmacol. Rev. 53, 25-71; Brunner, 
N..et.al. (1993) Cancer Res. 53, 3229-3232). It is likely that the ability of ICI 1 82,780 to 

30 promote monomerization of ER and subsequent degradation by preventing the 

nuclear/cytoplasm shuttling of ER is the basis for its effectiveness as an antiestrogen. This 
interpretation is also consistent with second-line endocrine responses in patients who had 
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relapsed on tamoxifen but responded to ICI 182,780 (Howell, A. et al. (1996) Br. J. Cancer 
74, 300-308). It is unknown whether patients undergoing ICI 182,780 treatment develop 
resistance to the compound. However, the continuous long-term exposure of estrogen 
responsive breast cancer cells that are initially growth inhibited by ICI 182,780 develop 
5 resistance to the compound (Larsen, S. S. et al. (1997) Int. J. Cancer 72, 1 129-1 136; 

Brunner, N. et al. (1997) Cancer Res. 57, 3486-3493), as observed with experimental cell 
models (for a review see: [Simpson, E. R. et al. (2002) Recent Prog. Horm. Res. 57, 317- 
338]). This appears, at least in part, to be due to the re-bounding synthesis of ERa with a 
concomitant increase in responsiveness to estrogens (Larsen, S. S. et al. (1997) Int. J. 

10 Cancer 72, 1 1 29-1 136). The regulation of ERa gene expression involves activity of several 
distinct promoters whose activities are mediated by AP-1, AP-2, and estrogen receptor 
factor 1 (ERF-1) binding sites that interact with a member of the AP-2 family proteins 
(Tang, Z. et al. (1997) Mol. Cell. Biol. 17, 1274-1280; deConinck, E. C. et al. (1995) Mol. 
Cell. Biol. 15, 2191-2196; Tanimoto, K. et aL (1999) Nucleic Acids Res. 27, 903-909). 

15 Similarly, Alu ERE, Oct-1, AP-1 and SP-1 sites regulate the expression of the ERp gene 
(Li, L. C. et al. (2000) Biochem. Biophys. Res. Commun. 275, 682-689). There is evidence 
that both ERs also auto-regulate their own transcription (Castles, C. G. et al. (1997) J. 
Steroid Biochem. Mol. Biol. 62, 155-163; Vladusic, E. A. et al. (2000) Oncol Rep 7, 157- 
167). It is likely that modulation of the synthesis or activity of transacting factors 

20 responsible for the ER expression could be responsible for re-bounding/increase expression 
of ERs. This, together with findings that ICI 182,780 treatment can also lead to cross- 
resistance to tamoxifen (Brunner, N. et al. (1997) Cancer Res. 57, 3486-3493), indicates that 
the estrogen-mediated ER signaling participates in the development of acquired endocrine 
resistance. This reinforces expectations that inhibition of estrogen biosynthesis by 

25 aromatase inhibitors or by GnRH analogs together with antiestrogenic compounds could 
provide more effective treatment regimens for hormone responsive breast cancer. 

56. As in endocrine de novo resistance, growth factor signaling pathways become 
up-regulated and/or activated in resistant breast cancer cells, which show an increased 
dependence on growth factor signaling pathways as an adaptive mechanism (Yarden, Y. 

30 (2001) Oncology 61 Suppl 2, 1-13). Therefore, blockage or inhibition of growth factor 

signaling pathways in acquired endocrine resistance could also provide a basis for treatment. 
Indeed, in situ, in vivo, and clinical studies clearly indicate that inhibition of a variety of 
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growth factor-mediated signaling processes is effective in the prevention of endocrine- 
resistant phenotypes (Nicholson, R. I. et al. (2001) Endocr Relat Cancer 8, 175-182; Jeng, 
M. H. et al. (2000) Breast Cancer Res. Treat. 62, 167-175). The efficacy of anti-growth 
factor modalities can be further enhanced by combined treatments involving estrogen 
5 synthesis inhibitors and/or antiestrogens (Wakeling, A. E. et al. (2001) Clin Cancer Res 7, 
4350s-4355s; discussion 4411s-4412s). 

c) Gene based approaches for breast cancer 

57. Estrogens and growth factor signaling pathways mediate many interdependent 
events optimal for cell growth and survival. In the absence or loss of regulatory inputs from 

10 estrogen signaling due to deficiencies in ER expression/function, aberrations in signaling 
pathways mediated by growth factors appear to be central to de novo endocrine resistant 
cancers. Targeted inhibition of the growth factor signaling cascade is a target in the 
treatment of de novo resistant breast cancers. In addition, induction of ER expression/re- 
expression could provide a novel therapeutic benefit by restoring functional estrogen 

15 signaling. Consistent with this premise, experimental studies have shown that re- 
introduction of ERa or of ERp into ER-negative breast cancer cell models by stable 
transfections or by an adenoviral gene delivery re-establishes some features of estrogen 
responsiveness, decreases proliferation, motility and invasiveness of tumor cells (Lazennec, 
G. et al. (2001) Endocrinology 142, 4120-4130 ; Kushner, P. J. et al. (1990) Mol. 

20 Endocrinol. 4, 1465-1473 ; Garcia, M. et al. (1992) Proc Natl Acad Sci U S A 89, 1 1538- 
11542; Zajchowski, D. A. et al. (1993) Cancer Res. 53, 5004-5011; Catherino, W. H. et al. 
(1995) Cancer Lett. 90, 35-42; Lazennec, G. et al. (1999) Mol. Cell. Endocrinol. 149, 93- 
105; Lazennec, G., et al. (1999) Mol. Endocrinol. 13, 969-980). These findings concur the 
prediction that establishment of functional estrogen-ER signaling could provide an effective 

25 treatment strategy for ER-negative breast cancers. 

58. Although, as in de novo endocrine resistance, aberrant growth factor signaling 
pathways play a critical role in the initiation and progression of acquired endocrine 
resistance, gain-of-function of ER is coincident with the development of resistance. This 
implies that additional therapeutic benefits could be achieved by reducing ER function. 

30 Strategies to further ablate the intracellular activity of ER by employing anti-sense 

technologies to inhibit ER expression and ER-dominant negative mutants to interfere with 
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protein function constitute paradigms for the development of novel strategies for breast 
cancer treatments. 

(1) Targeting ER 

59. Dominant-negative receptors are structurally altered forms that are deficient in 
5 transcriptional activation of responsive genes, but they retain the ability to suppress the 

activity of the wild-type receptor (Herskowitz, I. (1987) Nature 329, 219-222). Studies 
exploring the functional inactivation of ERa have identified and generated ERa mutants 
with dominant-negative phenotypes. These include truncated or point mutant ERa variants 
in which substitution or deletion mutations in the carboxyl terminal region severely affect 

10 the estrogen-dependent AF-2 function (Ince, B. A. et al. (1993) J. Biol. Chem. 268, 14026- 
14032). Using adenovirus-mediated gene delivery systems, further studies found that 
expression of dominant-negative or anti-sense ERa effectively suppresses proliferation and 
induces apoptosis in both ER-positive breast cancer cells (Lee, E. J. et al. (2001) Mol. Med. 
7, 773-782; Lazennec, G., et al. (1999) Mol. Endocrinol. 13, 969-980; Madden, T. A. et al. 

15 (2000) Eur. J. Cancer 36 Suppl 4, S34-35) and xenograft models (Lee, E. J. et al. (2001) 
Mol. Med. 7, 773-782). Dominant negative interference is also the basis for the generation 
of novel ER proteins. Genetic integration of either the Krupple-associated box (KRAB) (de 
Haan, G. et al. (2000) J. Biol. Chem. 275, 13493-13501) or the repression domain of NCoR 
(Chien, P. Y. et al. (1999) Mol. Endocrinol. 13, 2122-2136) as single or multiple copies to 

20 WT or mutant ERa produced dominant negative ER species that effectively repressed both 
basal and estrogen-ER mediated transcription in situ. Similarly, a chimeric receptor 
composed of the DBD of ERa for binding to ERE, the KRAB repressor domain to silence 
target genes and a truncated LBD of progesterone receptor to respond only to the synthetic 
ligand, RU486, effectively blocks estrogen-ER mediated transcription under the control of 

25 RU486 (Ma, Z. Q. et al. (1999) J. Steroid Biochem. Mol. Biol. 69, 155-163). 

60. These approaches are confirmatory for the prediction that inhibition of ER 
synthesis/function could provide an alternative/complementary modality for targeted therapy 
of breast cancers. Dominant negative interference of ERa is suggested to occur at multiple 
levels of the signaling pathway that includes the formation of inactive heterodimers, 

30 competition for ERE binding, and specific transcriptional silencing. However, determining 
target gene responses during co-synthesis of different ER species is difficult. This is 
because dimers formed in the presence of more than one ER species are a heterogeneous 



— 23 — 



WO 2005/033291 



PCT/US2004/032561 



population of homo- or hetero-dimers. Moreover, heterodimerization also provides novel 
characteristics to ERby combining distinct functions of the contributing partners (Tremblay, 
G. B. (1999) Molecular & Cellular Biology 19, 1919-1927; Muyan, M. et al. (2001) Mol'. 
Cell. Endocrinol. 182, 249-263). This combinatorial diversity, hence heterogeneity, of ERs 
5 is an obstacle for the use of variants as effective means of regulating estrogen signaling. 

61. Disclosed herein is a genetic conjugation approach to circumvent the pivotal 
dimerization step in receptor action (Muyan, M. et al. (2001) Mol. Cell. Endocrinol. 182, 
249-263). Homofusion ERa was engineered as a prototype by genetically conjugating two 
ER monomers into a covalently fused single-chain protein to obtain a homogeneous 

10 population. Disclosed herein the single-chain ERa is in a "closed, dimer-like" 

configuration through an znfra-molecular interaction, in contrast to WT-ERa that is a 
homodimer by an iwter-molecular assembly (Muyan, M. et al. (2001) Mol. Cell. Endocrinol. 
182, 249-263). The genetic conjugation approach precludes the limitation of monomer 
association into biologically active dimers and the dissociation of dimers into inactive 

15 monomer induced by destabilizing mutations. This method offered an opportunity to 

expand the understanding of structure- function relationships of not only ER homodimers but 
also heterodimers with biological activity that is difficult to assess due to presence of the 
homodimers of contributing partners. Permitting functional analysis of unique symmetrical 
or asymmetrical mutations that simulate variant homo- and heterodimers, the genetic 

20 conjugation can also allow elucidation of the mechanisms of action of specific homo- and 
hetero-dimers of ER variants. This, in turn, can reveal their contribution to tumor 
progression and acquisition of resistance to antiestrogen treatments. To examine the effects 
of a dominant-negative receptor on transcription induced by WT-ERa, transcriptionally 
defective mutants in which point mutations obliterate the AF-2 function were generated. 

25 The WT ERa (ERa) and/or the AF-2 mutant (AF2) were then genetically fused to generate 
single chain receptors that emulate the WT homodimer (a-a), heterodimer (ERa-AF2 and 
AF2-ERa) and AF-2 homodimer (AF2-AF2). When co-expressed with WT-ERa, only the 
single-chain variant deficient in both AF-2 functions repressed the reporter activity more 
effectively than the ERa variant bearing the same AF-2 mutation, which has the ability to 

30 also heterodimerize with WT-ERa (Muyan, M. et al. (2001) Mol. Cell. Endocrinol. 182, 

249-263). This single-chain ER approach provides a model for generating fusion constructs 
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to inhibit tumor growth by either deactivation of dominant-positive or activation of 
dominant-negative pathways. 

(2) Targeting EREs. 
62. Dominant negative ER variant or single-chain ER strategies to attenuate or 
5 completely prevent ER functions can provide effective approaches to combat breast cancer. 
Since the interaction of the estrogen-ER complex with ERE of each responsive gene 
constitutes a critical genomic signaling pathway, targeted regulation (activation/repression) 
of ERE-driven genes disclosed herein provides a compelling approach in breast cancer 
treatments. 

10 63. The disclosed ERE binding transregulators, on the othe* hand, potently regulate 

the expression of ERE-containing genes independent of ligand, dimerization, ER-subtypes, 
promoter- and cell-context without altering ERE-independent genomic signaling. 
Furthermore, transregulators, just as the estrogen-ERct complex, differentially altered cell 
cycle progression in cells derived from breast cancer. Disclosed herein ERE binding 

15 activators repress the cell cycle progression in ERa-negative MDA-MB-23 1 cells, while 
they enhance cell cycle progression in ERa-positive MCF-7 cells (Huang, J. et al. (2004) 
Mol. Cell. Endocrinol. 218, 65-78). 

64. Emphasizing the importance of ERE-containing genes in the regulation of cell 
proliferation, these findings indicate that EBAs can be used for identification and 

20 subsequent elucidation of the roles of ERE-containing genes in physiology and 

pathophysiology of breast tissue. This provides a better understanding of the differences in 
the mechanisms of genomic responses manifested as phenotypic diversity among breast 
tumors. This understanding can aid strategies for cancer treatment. The suppression of 
ERE-containing genes, for example, by the designer ERE-binding repressors can constitute 

25 a therapeutic approach for ER expressing endocrine sensitive and acquired endocrine 

resistant breast tumors. Furthermore, the activation of the ERE-containing gene network by 
EBAs could provide an effective strategy for the treatment of ER-negative neoplasms. This 
can serve as a basis to alter the growth of breast tumors independent of ER status. 
Furthermore, in contrast to antiestrogen therapy, such constructs could be targeted to 

30 specific tissues of interest by gene therapy approaches. 
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(3) Establishment of an effective gene delivery system. 

65. Recombinant adenoviruses are versatile tools for gene delivery and expression 
due to wide cellular tropisms and prolonged transgene expression profiles that are 
independent of host cell division. The most commonly used adenoviral vector is the human 

5 adenovirus serotype 5 (Ad5), which is rendered replication defective by the deletion of the 
essential genes, El and E3. Ad5 has been adapted into an efficient vector for in situ, ex vivo 
and in vivo gene transfer (Kozarsky, K. F. et al. (1993) Curr. Opin. Genet. Dev. 3, 499-503; 
Channon, K. M. et al. (1997) QJM 90, 105-109; Young, L. S. et al. (2001) Gut 48, 733- 
736). Recent studies further indicated that Ad5 could successfully be used for gene transfer 

10 into breast cancer cell lines with high efficiency (Lee, E. J. et al. (2001) Mol. Med. 7, 773- 
782; Lazennec, G. et al. (1999) Mol. Cell. Endocrinol. 149, 93-105; Lazennec, G., et al. 
(1999) Mol. Endocrinol. 13, 969-980; Lucas, A. et al. (2003) Biochem. Biophys. Res. 
Commun. 309, 101 1-1016). One major advantages of recombinant adenovirus compared to 
other gene delivery systems is the fact that virus remains epichromosomal, that is not 

15 incorporated into the host genome, in cells. This prevents inadvertent activation or 
inactivation of the host genome by the adenovirus genes. 

66. Disclosed herein is the use of the AdEasy XL system (Stratagene, La Jolla, CA) 
to generate recombinant adenovirus as our gene delivery vehicle. The AdEasy XL system 
significantly shortens the time required producing the recombinant adenoviral plasmid by 

20 homologous recombination in E.coli. The cDNA for p-galactosidase, CDC, ERa, an EBA 
or EBR was cloned into the shuttle vector, pShuttle-CMV. In this vector, the strong 
cytomegalovirus (CMV) promoter drives the expression of the gene of interest. The 
resultant vector was transformed into E.coli strain BJ5183-pAD-l pre-transformed with the 
adenoviral backbone plasmid p AdEasy- 1 for homologous recombination. Bacterial colonies 

25 containing the recombinant adenoviral plasmid were selected against kanamycin, 

propagated and purified by a DNA Maxi prep kit (Qiagen). The recombinant vector was 
then linearized and transiently transfected into AD-293 cells for virus production. AD-293 
cslls~are derived from human embryonic kidney cells, HEK 293, which are transformed by 
the sheared Ad5 DNA. AD-293 cells produce adenovirus in trans allowing the production 

30 of infectious virus particles when cells are transfected with pAdEasy-1 vector. 
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d) Cell models for breast cancer ph en o types 

67. Establishment of an effective gene delivery system to breast cancer cells can 
make our approach more amenable for examining the effects of altered expression of ERE- 
containing genes. The ER-negative MDA-MB-231 cell line (American Type Culture 

5 Collection, ATCC, Rockville, MD) can be used as a model for de novo endocrine resistant 
breast cancer phenotype. The use of MCF-7 cells (ATCC) that endogenously synthesize 
ERs, primarily ERa, can represent the model cell line emulating endocrine sensitive breast 
neoplasms. 

68. The MCF7/LCC sublines can be used as experimental cell models. The 

10 generation of endocrine resistant sublines through in vitro selection of the parent MCF-7 

cell line against antiestrogens provided a means to study acquired resistance. These sublines 
retain ER expression and show various patterns of resistance and cross-resistance to 
antiestrogens. LCC variants were established from an MCF-7 phenotype that grew 
independent of E2 in ovariectomized nude mice (Clarke, R. et al. (1989) Proc. Natl. Acad. 

15 Sci. USA 86, 3649-3653). Subsequent in vitro propagation of cells from these tumors led to 
the generation of an MCF7/LCC1 phenotype that retains ER expression and E2-independent 
growth but are responsive to growth inhibition by Type I and II antiestrogens (Brunner, N. et 
aL (1993) Cancer Res. 53, 283-290). Although transplanted LCC1 cells efficiently produce 
tumors in the absence of E2, tumor growth is accelerated in the presence of E2. Therefore, 

20 LCC1 phenotype retains some degree of E2 responsiveness in vivo (Brunner, N. et al. (1993) 
Cancer Res. 53, 283-290). Further selection of MCF7/LCC1 cells with graded 
concentrations of TAM resulted in a TAM-resistant MCF7/LCC2 variant. This subline also 
showed E2-independent growth in vitro and in vivo but were sensitive to growth inhibition 
by ICI (Brunner, N. et al. (1993) Cancer Res. 53, 3229-3232). It is this subline that provides 

25 a model for tumors that initially responded to Type I antiestrogens, but then acquired 
resistance to these compounds (Brunner, N. et al. (1997) Cancer Res. 57, 3486-3493). 
Subsequent selection of MCF7/LCC1 cells against increasing concentrations of ICI resulted 
in the phenotype of MCF7/LCC9, which is ER-positive and E2 independent, and ICI and 
TAM-crossresistant (Brunner, N. et al. (1997) Cancer Res. 57, 3486-3493). This subtype is 

30 a model for a full endocrine resistant breast cancer. LCC sublines are reported to be more 
invasive and metastatic in vivo than parental MCF-7 cells (Clarke, R. et al. (2001) 
Pharmacol. Rev. 53, 25-71). It appears that LCC phenotypes up-regulate some E2 
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responsive genes, exemplified by the pS2 and PR genes, with concurrent loss in 
responsiveness to the ligand-ER complex (Brunner, N. et al. (1993) Cancer Res. 53, 283- 
290; Gu, Z. et al. (2002) Cancer Res. 62, 3428-3437). MCF7/LCC 1 , /LCC2 and /LCC9 cell 
lines were obtained and are described in Clarke R. et al. (2001) Pharm. Rev. 53:25-71. 
5 C. Compositions 

69. Disclosed are the components to be used to prepare the disclosed compositions 
as well as the compositions themselves to be used within the methods disclosed herein. 
These and other materials are disclosed herein, and it is understood that when combinations, 
subsets, interactions, groups, etc. of these materials are disclosed that while specific 

10 reference of each various individual and collective combinations and permutation of these 
compounds may not be explicitly disclosed, each is specifically contemplated and described 
herein. For example, if a particular (VP16) 2 -CDC-(VP16) 2 is disclosed and discussed and a 
number of modifications that can be made to a number of molecules including the (VP16) 2 - 
CDC-(VP16) 2 are discussed, specifically contemplated is each and every combination and 

15 permutation of (VP16) 2 -CDC-(VP16) 2 and the modifications that are possible unless 
specifically indicated to the contrary. Thus, if a class of molecules A, B, and C are 
disclosed as well as a class of molecules D, E, and F and an example of a combination 
molecule, A-D is disclosed, then even if each is not individually recited each is individually 
and collectively contemplated meaning combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, 

20 and C-F are considered disclosed. Likewise, any subset or combination of these is also 
disclosed. Thus, for example, the sub-group of A-E, B-F, and C-E would be considered 
disclosed. This concept applies to all aspects of this application including, but not limited 
to, steps in methods of making and using the disclosed compositions. Thus, if there are a 
variety of additional steps that can be performed it is understood that each of these 

25 additional steps can be performed with any specific embodiment or combination of 
embodiments of the disclosed methods. 

70. Disclosed herein are transactivators that specifically target ERE to regulate the 
expression of ERE -containing genes. The modular nature of ER permitted the design of a 
monomelic ERE binding module by genetically joining two DNA binding domains with the 

30 hinge domain. Discloed herein is the integration into this module of strong activation 
domams-from other transcription factors to generate constitutively active ERE binding 
activators (EBAs). EBAs potently induced the expression of only ERE-containing genes 
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independent of ligand, dimerization, ER subtypes, promoter- and cell-context. Moreover, 
EBAs, as E2-ERa, differentially altered cell cycle progression in cells derived from breast 
cancer. 

1. Linked modulators 

5 71 . As discussed herein, linked modulators are engineered molecules that bind DNA 

at specific sites. Linked modulators can act as transcription modulators, by for example, 
activating transcription or repressing transcription. Linked modulators comprise at least two 
DNA binding domains as discussed herein, which are covalently linked. The covalent 
linkage occurs via a linker, which in certain embodiments can be a hinge as discussed 

10 herein. The linked modulators can include a number of other domains, such as activation 
domains or repressor domains, or other interaction domains, or nuclear localization signals, 
or targeting domains for example degradation, or regulation domains, such as domains that 
regulate them depending on the type of cell the linked modulator is in. 

72. One way of representing linked modulators is by using the formula W-Z-X, 

15 wherein W is a first DNA binding domain (DBD), X is a second DNA binding domain, and 
Z is a linker. The linked modulator can further comprise an activation domain or repressor 
domain designated Y. Disclosed are any permutations of these combinations. It is 
understood that the two DNA binding domains can be the same or similar or different 
domains. In addition, as disclosed herein there can be more than one activation or repressor 

20 domain. These can be designated as Y,, Y 2 , Y 3 , Y 4 , Y 5 , Y 6 , and so forth. Thus, the linked 
modulators, for example, could be Y|,-W-Z-X or Yi-Y 2 - W-Z-X- Y 3 , and so forth. It is 
understood that when there is more than one activation or repressor domain, the domains 
can be the same or different domains, for example, two VP 16 domains, or one VP 16 and 
one p65 domain. It is understood that the activation domains or repressor domains can be 

25 fused to the linked modulators at either the amino terminal end or carboxyl end or both of 
the linked modulator. Furthermore the activation or repressor domains themselves can be 
linked in either an amino terminal or carboxyl orientation. Furthermore, there can be any 
number of activation or repressor domains linked in any fashion. For example, there can be 
one, two, three, four, five, six, seven, eight, nine, ten, or more domains linked on either or 

30 both the amino and carboxyl sides of the linked modulator. 
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73. It is also understood that proteins have an amino and carboxyl end, and unless 
indicated otherwise, they are written left to right, amino to carboxyl. Thus, W-X-Y unless 
otherwise indicated would be "amino terminal-W-X-Y-carboxyl terminal.'* 

74. In certain embodiments of the linked modulators which are based on the ER, it is 
5 understood that herein the DNA binding domain is referred to as C and the linker or hinge 

domain is referred to as D. Thus, for ER based linked modulators, a linked modulator could 
be CDC. 

75. It is also understood that more than one linked modulator can be used. In these 
embodiments, there would be more than one element that the linked modulator would 

10 recognize, on the target DNA. Thus, in this type of embodiment, there would be at least 
four DNA binding sites, that would be bound by the at least four DNA binding domains, 
two in each linked modulator, if there are two linked modulators being used. As disclosed 
herein, the effect of having more than one linked modulator can be synergistic. 

76. Furthermore, a linked modulator could have more than two DNA binding 

15 domains, such as three or four DNA binding domains. These types of modulators would 

also have additional linker regions, and typically would have one less linker than the number 
of DNA binding domains. Thus, linked modulators, such as W-Z-X-Z-W-Z-X, or any 
combination are disclosed. It is understood that the DNA binding domains can be the same 
or different when there are multiple domains. 

20 2. DNA binding domains 

77. The DNA binding domain (DBD) is responsible for targeting the receptors to 
their hormone response elements (HRE). The DNA binding domain binds as a dimer with 
each monomer recognizing specific contacts in the major groove of the DNA at each half- 
site. These contacts allow the dimer to read the sequence, spacing and orientation of the 

25 half-sites within its response element, and thus discriminate between sequences. Such 
domains are well-known in the art and can be acquired from many different proteins or 
enzymes. It is understood that the disclosed compositions can be practiced with any such 
domain including those not specifically discussed herein. Thus, in certain embodiments 
compositions comprising a first and a second DNA binding domain, wherein the first DNA 

30 binding domain comprises a DNA binding domain of a transcription factor, are disclosed. In 
certain embodiments the DNA binding domain can be a DNA binding domain from a 
steroid receptor. For example, the DNA binding domain can be from, for example, 
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Glucocorticoid receptor (GR), Androgen receptor (AR), Progesterone receptor (PR), 
Thyroid hormone receptor (TR), Retinoid X receptor (RXR), all-trans Retinoic Acid 
receptor (RARa), and Vitamin D receptor (VDR). Representative examples of the sequence 
of these proteins can be found in SEQ ED NOs: 53, 57, 61, 65, 69, 73, and 77 respectively. 
5 Also disclosed are compositions wherein the DBD is from Estrogen Receptor ERa or 
ERp which are set forth in SEQ ID NO: 45 and 49 respectively. 

78. The DNA binding regions of hormone receptors contain two zinc finger-like 
modules that fold to form a single functional domain. The protein dimer contacts 
nucleotides in adjacent major grooves on one face of the DNA helix containing a response 

10 element. Distinct residues in a region of the first zinc-finger module of the C domain, P- 
box, determine the DNA binding specificity (sequence discrimination) through a protein- 
DNA interaction. Whereas residues in the second zinc-finger-like module, the D-box, are 
involved in the discrimination of half-site spacing through a protein-protein interaction 
(dimerization) that influence the spacing and relative orientation of the two DNA associated 

15 monomers (Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. (1995) 
Cell 83, 835-839). These structural constraints prevent each hormone receptor from binding 
promiscuously to REs by the recognition of the central spacing, sequence and arrangement 
of half-sites (Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. 
(1995) Cell 83, 835-839). 

20 79. The compositions can comprise a second DBD. Therefore, disclosed are 

compositions, wherein the second DNA binding domain comprises a DNA binding domain 
of a transcription factor. Specifically disclosed are compositions comprising a second DBD 
from a transcription factor, wherein the transcription factor can be selected from the group 
of transcription factors consisting of Estrogen receptor (ER), Glucocorticoid receptor (GR), 

25 Androgen receptor (AR), Progesterone receptor (PR), Thyroid receptor (TR), Retinoid X 
receptor (RXR), all-trans Retinoic Acid receptor (RAR), and Vitamin D receptor (VDR). 
Also disclosed are compositions wherein the DBD is from ERa or ERp. 

80. The DNA binding domain of the ER can be found at for example SEQ ID NO: 
23 (ERa) which is amino acids 1 82-25 1 of SEQ ID NO:45 and SEQ ID NO: 25 (ERP) 

30 which is amino acids 144-214 of SEQ ID NO:49, of the GR at SEQ ED NO: 27 which is 

amino acids 417-491 of SEQ ID NO:53, of the AR at SEQ ID NO: 29 which is amino acids 
370-442 of SEQ ID NO:57, of the PR at SEQ ID NO: 31 which is amino acids 562-635 of 
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SEQ EDNO:61, of the TR at SEQ ID NO: 33 which is amino acids 98-170 of SEQ ID 
NO:65, of the RXR at SEQ ID NO: 35 which is amino acids 131-201 of SEQ ID NO:69, of 
the RAR at SEQ ID NO: 37 which is amino acids 84-156 of SEQ ID NO:73, and of the 
VDR at SEQ ID NO: 39 which is amino acids 22-92 of SEQ ID NO:77. 

5 81 . It is understood that the DNA binding domains based on the disclosed steroid 

receptor proteins can vary, and still maintain their specificity and function. For example, 
Glucocorticoid (GR), Progesterone (PR) and androgen (AR) receptors recognize hormone 
response elements (REs) having half-sites, 5'-AGAACA-3', similar to that of the ERE. 
Thyroid (TR), retinoid X (RXR), all-trans retinoic acid (RAR) and vitamin D (VD) 

10 receptors, on the other hand, interact with REs with half-sites, 5 , -AGGTCA-3 t , identical to 
that of ERE. These sites however are arranged in direct (RXRE, RARE, VDRE, TRE) or 
inverted (GR, PR, AR RE) repeats with various spacing. ERE half-sites without spacing 
also produce a palindromic TRE. The DNA binding regions of these hormone receptors 
contains two zinc finger-like modules that fold to form a single functional domain. The 

15 protein dimer contacts nucleotides in adjacent major grooves on one face of the DNA helix 
containing a response element. Distinct residues in a region of the first zinc-finger module 
of the C domain, P-box, determine the DNA binding specificity (sequence discrimination) 
through a protein-DNA interaction. The P box of ERa and ER(3 contains EGckA, ie. Glu- 
Gly-Cys-Lys-Ala,; whereas the P box of PR, for example,contains GSckV, ie. Glu-Ser-Cys- 

20 Lys-Val amino acid sequence. These amino acid composition is critical to the ability of a 
DBD to discriminate between the PRE/GRE and the ERE. Any mutational change in the P 
Box sequence of a steroid/thyroid hormone receptor disturbs the specificity of recognition of 
a DNA response element. When the first amino acid Gly, G in the GR-DBD, for example, 
was mutated to Trp, W, results in promiscuous binding to many response elements including 

25 the PRE/GRE and the ERE. Residues in the second zinc-finger-like module, the D-box, are 
involved in the discrimination of half-site spacing through a protein-protein interaction 
(dimerization) that influence the spacing and relative orientation of the two DNA associated 
monomers. These structural constraints prevent each hormone receptor from binding 
promiscuously to REs by the recognition of the central spacing, sequence and arrangement 

30 of half-sites. 
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3. Activation domains 

82. The activation domains that can be attached and used as disclosed herein can in 
certain embodiments be any activation domain. By activation domain is meant a protein 
domain that when brought within proximity of a particular transcription start site, will cause 

5 transcription from that site to be increased relative to transcription from that site in the 
absence of the activation domain. Such modulation can represent an activation of gene 
expression. Herein, "activation of expression" is understood to mean any increase in the 
transcription expression of a particular gene above the native amount or above a control 
amount. For example, if a particular composition comprising a transcription modulator or 

10 linked modulator is used to change gene expression and the modulator is an activation 

domain, then "activation" would be any increase in the gene expression with the activation 
domain over the gene expression of a control without the transcriptional modulator. 

83. Activation domains can be amphipathic helices (containing polar and nonpolar 
regions) or aliphatic regions of peptide (containing uncharged, nonpolar hydrocarbon 

15 residues). Activation domains can also be regions of peptide that have a strong acidic nature 
because of the presence of acidic residues, which are called acidic activation domains. One 
example of an activation domain is the strong AD of the herpes simplex virion protein, 
VP16. This is an acidic activation domain. Another example of an AD is the p65 subunit 
of the nuclear factor kB, NFkB. An example of a VP1 6 AD can be found in SEQ ID NO: 

20 42, and an example of a p65 AD can be found in SEQ ID NO:41 . Other examples of 

activation domains include, but are not limited to the human heat shock factor HSF-1 and 
GAL4. It is understood and and herein contemplated that activation domains can comprise 
any activation domain known or unknown in the art. Thus, one embodiment is an acidic 
activation domain. Another embodiment is an aliphatic activation domain. Also disclosed 

25 is an amphipathic activation domain. It is understood that more than one activation domain 
may be present. Therfore, specifically contemplated are multiple activation domains. For 
example, specifically disclosed are compositions comprising one, two, three, four, six, eight, 
and ten activation domains. It is understood that activation domains from different sources 
can be used to obtain optimal modulation of transcription. Therefore, specifically disclosed 

30 are compositions comprising an activation domain, wherein the activation domain 
comprises one VP 16 activation domain and one p65 activation domain. Thus, it is 
understood that the disclosed compositions can comprise one amphipathic and one acidic 
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activation domain. Also discosed are compositions comprising multiple acidic and/or 
multiple amphipathic activation domains. Thus specifically contemplated and disclosed are 
compositions, wherein the activation domain comprises two VP 16 activation domains and 
two p65 activation domains. 
5 84. The activation domains can be characterized by how much they increase 

transcription. Disclosed are activation domains that, when used with linked modulators, 
increase transcription at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 50, 100, 150, 200, 500, 
or 1000 fold relative to no linked modulator and activation domain. Furthermore, in certain 
situations, activation can be considered additive, meaning, for example, the activity of one 

10 domain is X, the activity of two domains is 2X, the activity of three domains is 3X, and the 
activity of four domains is 4X, and so forth. Another type of activity is synergistic activity, 
which means that the total effect of the activation is more than what the additive effect 
would be. For example, if activity of one domain is X, and the activity of four domains is, 
for example, 10X (more than 4X), the activity would be synergistic. 

15 4. Repressor domains 

85. Similarly, the linked modulators can comprise repressor domains of gene 
expression. By repressor domain is meant a protein domain that when brought within 
proximity of a particular transcription start site, will cause transcription from that site to be 
decreased relative to transcription from that site in the absence of the repressor domain. 

20 Such modulation can represent an repression of gene expression. Herein "repression of 
expression" refers to any decrease in the level of gene expression below the native or a 
control amount. For example, if a particular composition comprising a transcription 
modulator is used to change gene expression and the modulator is a repressor domain, then 
"repression" would be any decrease in the gene expression with the repressor domain below 

25 the gene expression of a control without the transcriptional modulator. The repressor 
domains can be characterized by how much they decrease transcription. Disclosed are 
repressor domains that, when used with linked modulators, decrease transcription at least 2, 
3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 50, 100, 150, 200, 500, or 1000 fold relative to no 
linked modulator and repressor domain. 

30 86. Examples of repressor domains include, but are not limited to Kriipple associated 

box (KRAB) and Sin3 interaction domain (SID). Other examples of repressor domains 
include, but are not limited to MAD, DAD, and ChromoShadow. It is also understood that 
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more than one transcriptional modulator or repressor domain can be used in the 
compositions described herein. Thus specifically disclosed are compositions comprising a 
repressor domain, wherein the repressor domain comprises one, two, three, four, five, six, 
seven, eight, nine, ten or more activation domains, such as a KRAB repressor domain or a 

5 SID repressor domain. It is understood that activation domains from different sources can 
be used to obtain optimal modulation of transcription. Therefore, specifically disclosed are 
compositions comprising a repressor domain, wherein the repressor domain comprises one 
KRAB repressor domain and one SID repressor domain. Also disclosed are compositions, 
wherein the repressor domain comprises two KRAB repressor domains and two SID 

10 repressor domains. An example of a KRAB repressor domain can be found in SEQ ID 
NO:44. An example of a SID repressor domain can be found in SEQ ID NO:43. 

87. The Kriippel associated box (KRAB) domain is an amino acid sequence motif 
found at the amino-terminus of nearly one-third of all Kruppel/TFIDA-type C2H2 zinc 
finger proteins (Bellefroid EJ. et al. (1991) Proc Natl Acad Sci USA 88:3608-12). This 

15 highly conserved domain displays potent, DNA-binding dependent repression of 

transcription that requires the KRAB-associated protein- 1 (KAP-1) as the co-repressor 
(Margolin JF et al. (1994) Proc Natl Acad Sci USA 91:4509-13; Friedman JR et al. (1996) 
Genes Dev 10:2067-78; Moosmann P. et al. (1996) Nucleic Acids Res 24:4859-67; 

87. Moosmann P. et al. (1997) Biol Chem 378:669-77). The KAP-1 co-repressor in 

20 turn recruits the m/cleosome remodeling histone rfeacetylase complex (NuRD). The NuRD 
complex is composed of three functionally distinct proteins, Mi-2, MTA-2 and MBD3 that 
assemble through multiple mter-molecular interactions to form the complex. Mi-2 
possesses an ATP-dependent chromatin remodeling activity and anchors the NuRD complex 
to the core HDAC1 and HDAC2 complexes that deacetylate histones. The methyl-DNA- 

25 binding protein MBD3, on the other hand, appears to mediate the ability of NuRD to repress 
the transcription in a methylati on-dependent manner. The subsequent interaction of the 
NuRD-HADC complex with HP1, a family of nonhistone heterochromatin-associated 
proteins, results in the formation of a heterochromatin-like complex, leading to gene 
silencing. All members of the KRAB domains repress transcription when tethered to DNA. 

30 Fusions of the KRAB domain derived from KOX1/ZNF1 0 zinc finger protein to the DNA 
binding domains of both LexA and Gal4 factors, for example, effectively silence the 
transcription of reporter genes in transfected cells regardless of the distance of the cognate 
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binding element to the transcription initiation site. In addition to the recruitment of co- 
mpressor complexes by the fusion proteins to the promoter, specific inhibition of some 
component(s) of RNA polymerase II and m transcription by KRAB appears also involved in 
transcription suppression (Moosmann P. et al. (1996) Nucleic Acids Res 24:4859-67; 
5 Moosmann P. et al. (1997) Biol Chem 378:669-77). 

88. Transcriptional repression by the basic helix-loop-helix zipper protein Madl 
requires DNA binding as a ternary complex with its heterodimer partner Max and the 
mammalian co-repressor protein Sin3. Similarly, the repression of thyroid hormone 
responsive genes is mediated by the recruitment of Sin3 repressor complex by the 

10 unliganded TR-corepressor N-CoR/SMRT complexes. The Sin3 complex contains two 

additional proteins, SAP1 8 and SAP30. These proteins are integral components of the Sin3 
complex. They are involved in the facilitation or stabilization of co-repressor interactions 
with Sin3 and HDACs. The interaction between Madl and Sin3 is mediated by a 35 amino- 
acid region in the amino-teiminus of Madl (Sin3 interaction domain, or SID) (Kasten MM. 

15 et al. (1996) Mol Cell Biol 16:4215-21, Ayer DE et al. (1996) Mol Cell Biol 16:5772-81. 

89. ). The tethering of SID, just as KRAB, to a DBD of a transcription factor 
suppresses gene transcription mediated by the response element to which the parent 
transcription factor binds. As NuRD complex, Sin3 or SID-mediated transcriptional 
repression involves HDAC1 and HDAC2 recruitment to the promoter. Thus, although a 

20 distinct set of transcriptional repressors recruits different co-repressor complexes to 

establish a repressor, the recruitment of HADC1 and HDAC2 by NuRD or Sin3 complex is 
critical for the active gene repression (Kasten MM. et al. (1996) Mol Cell Biol 16:4215-21, 
Ayer DE et al. (1996) Mol Cell Biol 16:5772-81). In short, genetic conjugation of a single or 
multiple copies of SID, or KRAB, to either terminus or both termini of CDC provides a 

25 potent ERE-binding transrepressor. This is because SID or KRAB recruits correpressor 
proteins to the gene the CDC binds. This complex then recruits HDACs to lead to 
transcription repression. 

90. It is understood that it is possible for the compositions to a DBD linked to a 
KRAB repressor, wherein the DBD is from c-Myb (MybDBD SEQ ID NO: 82). It is an 

30 embodiment that the composition excludes a DBD from c-Myb or KRAB or linkages 

thereof. For example, one embodiment excludes MybDBD-KRAB and one embodiment 
does not consist of MybDBD-KRAB (Nawrath, M. et al. (2000) Cancer Gene Ther 7(6), 
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963-972). Nawrath et al. is herein incorporated by reference in its entirety for its teachings 
of MybDBD-KRAB. 

91. It is also understood that in certain embodiments a linked modulator, comprising 
two DNA binding domains connected by a linker, can act as a repressor. In these 

5 embodiments, no repressor domain is needed to for the linked modulator to function as a 
repressor. This is consistent with the linked modulator competing with an endogenous 
transactivator. 

92. Furthermore, it is an embodiment that one or more of the components of the 
composition or methods disclosed herein can be specifically excluded. 

10 5. Linker and Hinge regions 

93. The DNA binding domains of the linked modulator can be any linker. Typically 
they will be a peptide linker. Often they will be a peptide linker that is or has homology to a 
hinge region of a steroid receptor, such as the ER. Hinge domains are well known in the art 
of receptors and hormone response elements. Hinge domains (also referred to herein as 

15 linker domains) are herein contemplated to mean a flexible region comprising a nuclear 
localization signal which allows for transactivators to localize to the nucleus and functions 
to link two separate DNA binding domains yet providing flexibility to allow efficient 
interaction with a response element. The importance of flexibility of the linker domain is 
evident via the poor binding of DBD linked by the hinge domain of SEQ ID NOs: 83 and 84 

20 (Kuntz, M. A. et al. (1997) J Biol Chem 272, 27949-279456). The lack of flexibility in the 
linker domain, did not allow for efficient binding of the DBD to an ERE. The resulting 
binding produced a dissociation constant (K D ) of 38nM. More flexible linkers allow for 
stronger binding. For example, two ERa C domains linked by a ERa D domain (CDC) has 
a K D = 1.49 ± 0.1 6nM for binding to ERE. Thus, one embodiment is a linker, wherein the 

25 linker does not comprise SEQ ID NO: 83 or 84. Linker domains can be derived from 
receptors where the hinge domains link domains in the receptor. For example, it is 
understood that the hinge domain can comprise an ER D-domain. The D-domain can 
comprise any one of SEQ ID NOs:24, 26, 28, 30, 32, 34, 36, 38, or 40. It is understood that 
the hinge domain can comprise the ER D-domain of ERa or ER{3, and can comprise SEQ 

30 ID No 24 or SEQ ED NO:26, respectively. It is also understood that hinge domains from 
other hormone receptors can be used with equal effectiveness. 
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94. The hinge domain of the GR at SEQ ID NO:28, of the AR at SEQ ID NO:30, of 
the PR at SEQ ID NO:32, of the TR at SEQ ED NO:34, of the RXR at SEQ ID NO:36, of the 
RAR at SEQ ID NO:38, and of the VDR at SEQ ID NO:40. 
6. Examples of linked modulators 
5 95. One example of linked modulators are proteins based on ERE binding proteins. 

ERE binding proteins were engineered as potent activators of ERE-containing genes 
independent of ligand, ER subtypes, dimerization, promoter- and cell-type. Proteins with 
various configurations were initially generated to obtain an ERE binding module that 
interacts efficiently with the consensus ERE. As discussed herein, ER recognizes an ERE as 

10 a dimer with interaction primarily mediated by a dimerization interface located in the ligand 
binding domain and through a weak interaction surface in the C domain (Parker, M. G. 
(1998) Biochem Soc Symp 63, 45-50, Hall, J. M. et al. (2001) J Biol Chem 276, 36869- 
36872). Each C domain of the ER dimer makes analogous contacts with one of the inverted 
motifs, resulting in a rotationally symmetric structure (Glass, C. K. (1994) Endocr Rev 15, 

15 391-407; Luisi, B. F.et al. (1994) Vitam Horm 49, 1-47). Although the isolated C domain 
can interact with a half-site of the consensus ERE as a monomer, albeit with a substantially 
lower DNA binding affinity than that of the parent receptor, it requires ERE-dependent 
dimerization for the formation of a stable C-ERE complex (Schwabe, J. W. et al. (1993) 
Cell 75, 567-578; Nardulli, A. M. et al. (1991 J Biol Chem 266, 24070-24076; Kuntz, M. A. 

20 et al. (1997) J Biol Chem 272, 27949-279456). ERs share 97% amino-acid identity in their 
DBDs (Mosselman, S.et al. (1996) FEBS Letters 392, 49-53). This is reflected in their 
abilities to bind various ERE sequences with similar specificity and affinity by interacting 
with the same nucleosides (Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). Thus, the C 
domain of ERa was chosen as the template protein in the engineering of ER subtype- 

25 independent ERE binders (SEQ ID NO:23). The engineered proteins, linked modulators 

consisted of two C domains of ERa co-joined without (CC and CCD) or with the D domain 
as a Tinker (CDC) by the genetic conjugation approach (Fig 1A) (Muyan, M. et al. (2001) 
Mol'Cell'Ertdocrinol 182, 249-263). The choice of the D domain as a linker was based on 
_theJGollowing observations. First, although the isolated C domain is the minimal region for 

30 ERE binding, the immediate amino-terminal residues of the D domain contribute 

significantly to the DNA binding affinity by creating a stable interaction between the half- 
site-bound monomers (Green, S. et al. (1986) Cold Spring Harbor Symposia on 
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Quantitative Biology 51, 751-758; Schwabe, J. W. et al. (1993) Cell 75, 567-578; Kumar, 
V. et al. (1987) Cell 51, 941-951; Mader, S. et al. (1993) Nucleic Acids Res 21, 1 125-1 132). 
Second, the D domain, providing flexibility between the amino- and carboxyl-termini of the 
receptor, also contains patches of amino-acids that act cooperatively for the nuclear 
5 localization of ER (Picard, D. et al. (1990) Cell Regulation 1, 291-299). 
7. Steroid hormone receptors 

96. Different sets of genes can be regulated by the same transcription modulator. 
Furthermore, there can be families of transcription modulators which are related in the way 
that they function or in the types of DNA binding sites they interact with, for example. One 

10 family of transcription modulators is the steroid hormone receptors. A steroid hormone 

receptor generally is a transcriptional modulator that binds DNA in a homo or hetero dimer 
fashion, and which also binds a small ligand, such as a steroid hormone. Typically, the 
function of the steroid hormone receptor is to activate transcription, and this function is 
controlled by the binding of the ligand, the steroid hormone, to the steroid hormone 

15 receptor. The binding of the ligand can facilitate dimerization, i.e. in the absence of the 
ligand, dimerization does not take place, or the ligand can facilitate the effect on 
transcription by, for example, allowing the appropriate conformation of the activation 
domain, so that the activation domain can exert its effect on the basal transcription 
machinery. There are many different members of the steroid hormone receptor of 

20 transcription modulators. For example, some members of the steroid hormone receptor 
family are the Estrogen receptor (ER), Glucocorticoid receptor (GR), Androgen receptor 
(AR), Progesterone receptor (PR), Thyroid receptor (TR), Retinoid X receptor (RXR), all- 
trans Retinoic Acid receptor (RAR), and Vitamin D receptor (VDR). One steroid hormone 
receptor that is used as an example herein is the estrogen receptor. 

25 a) The Estrogen Receptor 

97. The estrogen receptor (ER) can represent an example of other steroid receptors, 
which might have different specifics but will share the general features discussed herein. 
The homeodynamic regulation of target tissue function by estrogen hormones, particularly 
estradiol 17(3 (E2), is mediated through a complex array of convergent and divergent 

30 signaling pathways. These pathways orchestrate genomic and non-genomic events that 

result in target tissue-specific responses (Parker, M. G. (1998) Biochem Soc Symp 63, 45-50; 
McKenna, N. J. et al. (1999) EndocrRev 20, 321-344; Hall, J. M. et al. (2001) J Biol Chem 
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276, 36869-36872). The flow of E2 information necessary for genomic effects is encoded in 
the regulatory DNA elements of each responsive gene and is conveyed by estrogen receptor 
(ER) a or p as homodimers, or as the heterodimer when co-expressed (Parker, M. G. (1998) 
Biochem Soc Symp 63, 45-50; McKenna, N. J. et al. (1999) Endocr Rev 20, 321-344; Hall, 
5 J. M. et al. (2001) J Biol Chem 276, 36869-36872). As with other transcription factors, ERs 
are modular such that isolated structural domains display subsets of the functional activities 
of the intact receptor (Parker, M. G. (1998) Biochem Soc Symp 63, 45-50; McKenna, N. J. et 
al. (1999) Endocr Rev 20, 321-344; Hall, J. M. et al. (2001) J Biol Chem 276, 36869- 
36872). The amino-terminus A/B domain contains a ligand-independent transactivation 
10 function (AF- 1). The central region is the DNA binding domain (DBD), or the C domain. 
The flexible hinge domain, the D domain, contains a nuclear localization signal and links 
the C domain to the multi-functional carboxyl-terminus, the E/F domain. E/F mediates 
ligand binding, dimerization, and the ligand-dependent transactivation function (AF-2) of 
the receptor. 

15 98. The elements of the genomic action of ER involve a multi-step regulation in 

which ER is converted from an inactive form to a transcriptionally active state (Parker, M. 
G. (1998) Biochem Soc Symp 63, 45-50; McKenna, N. J. et al. (1999) Endocr Rev 20, 321- 
344; Hall, J. M. et al. (2001) J Biol Chem 276, 36869-36872). This regulation is initiated by 
a conformational change in ER upon E2 binding, dissociation from chaperone proteins and 

20 dimerization. The dimer ER interacts with an element which is an inverted repeat that is 
separated by three non-specific nucleotides. The consensus estrogen responsive element 
(ERE) is 5 '-GGTCAnnnTGACC-3' and sequences with one or more nucleotide variations of 
the consensus can also act as EREs. The binding of ER to an ERE is the initial step in the 
regulation of ERE-containing gene transcription. Other proteins can also be involved, for 

25 example, the ERE bound E2-ERa complex (ERa is a type of ER) interacts with co- 
regulatory proteins. These additional interactions promote chromatin remodeling and 
bridging with general transcription factors that result in the formation of a transcriptome. 
This is a ligand- and ERE-dependent ER signaling pathway. 

99. The E2-ER complex also modulates gene expression by tethering to a trans- 

30 acting factor bound to a czs-element, e.g. the AP-1 element-bound fos/jun complexes. These 
pathways can be called ligand-dependent ERE-independent pathways (Hall, J. M. et al. 
(2001) J Biol Chem 276, 36869-36872; (Kushner, P. J. et al. (2000) J Steroid Biochem Mol 
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Biol 74, 311-317; Safe, S. (2001) Vitam Horm 62, 231-252). In these contexts, the E2-ER 
complex is not binding the DNA at its cognate element. In addition, E2 can elicit non- 
genomic effects through membrane ER or ER isoforms (Razandi, M. et al. (2000) Mol 
Endocrinol 14, 1434-1447; Gu, Q. et al. (1999) Endocrinology 140, 660-666; Benten, W. P. 
5 et al. (2001) Endocrinology 142, 1669-1677) and through the cytoplasmic ER (Chen, Z. et 
al. (1999) J Clin Invest 103, 401-406; Simoncini, T. et al. (2000) Nature 407, 538-541). 
These non-genomic pathways are linked to intracellular signal transduction proteins. 

100. Although ER plays a central role in the modulation of ERE-containing genes, 
the direction and magnitude of gene expression are dictated by the integrated actions of 

10 many transacting factors (promoter cross-talk), co-activators/integrators, components of the 
basal transcription machinery and the chromatin within which the gene resides (Parker, M. 
G. (1998) Biochem Soc Symp 63, 45-50; McKenna, N. J. et al. (1999) Endocr Rev 20, 321- 
344; Hall, J. M. et al. (2001) J Biol Chem 276, 36869-36872). The type and availability of 
protein components of a transcription complex specific to each gene characterize promoter- 

15 and cell-specific responses (Parker, M. G. (1998) Biochem Soc Symp 63, 45-50; McKenna, 
N. J. et al. (1999) Endocr Rev 20, 321-344; Hall, J. M. et al. (2001) J Biol Chem 276, 
36869-36872). These proteins, responding to diverse signaling pathways, act cooperatively 
or antagonistically with ER to modulate the gene expression (McKenna, N. J. et al. (1999) 
Endocr Rev 20, 321-344; Hall, J. M. et al. (2001) J Biol Chem 276, 36869-36872; Safe, S. 

20 (2001) Vitam Horm 62, 231-252). In addition to the gene promoter, ER and ER associated 
co-regulatory proteins are targets for various signaling pathways. Post-translational 
modifications of un-liganded or liganded ER and/or co-regulatory proteins by various 
signaling pathways alters the ability of the receptor to modulate transactivation of 
responsive genes (Ali, S. et al. (1993) Embo J 12, 1 153-1 160; Aronica, S. M. et al. (1993) 

25 Mol Endocrinol 7, 743-752; Kato, S. et al. (1995) Science 270, 1491-1494; Nilsson, S. et al. 
(2001) Physiol Rev 81, 1535-1565; Tremblay, A. et al. (1999) Mol Cell 3, 513-519). 

101. Along with the development and function of the mammary gland as one of 
the estrogen target tissues, E2 is also involved in the initiation and progression of breast 
cancer that results from uncontrolled growth and division of mammary ductal epithelial cells 

30 (Jensen, E. V. (1996) Annals of the New York Academy of Sciences 784, 1-17; Jordan, V. C. 
et al. (1999) Endocr Rev 20, 253-278; Russo, J. et al. et al. (2000) J Natl Cancer Inst 
Monogr, 17-37). The understanding of mechanisms of ER actions, dissection of ER 
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pathways and associated factors in the physiology and pathophysiology of ER signaling are 
critical prognostic and therapeutic goals in cancer research. The current paradigms to 
achieve these goals are largely based on compounds with diverse pharmacology that target 
ER (Jensen, E. V. (1996) Annals of the New York Academy of Sciences 784, 1-17; Jordan, 
5 V. C. et al. (1999) Endocr Rev 20, 253-278; McDonnell, D. P. et al. (1999) Trends 
Endocrinol Metab 10, 301-31 1). The binding of these ER-ligands to ER induce 
conformational changes that alter the ability of receptor to interact with co-regulatory 
proteins whose compositions dictate the receptor activity. However, since ERs are also a 
target for multiple signals emanating from intracellular signaling pathways (Hall, J. M. et al. 

10 (2001) J Biol Chem 276, 36869-36872; (Clarke, R. et al. (2001) J Steroid Biochem Mol 
Biol 76, 71-84), post-translational processing of ERs could preclude the need of ERs for 
ligand or could differentially alter the function of the liganded ERs. Furthermore, the 
circumvention of ligand-ER mediated DNA-dependent events by aberrant signaling 
pathways that promoter cross-talk with the ligand-ER complex (Hall, J. M. et al. (2001) J 

15 Biol Chem 276, 36869-36872; (Clarke, R. et al. (2001) J Steroid Biochem Mol Biol 76, 71- 
84) would hinder the regulatory effects of ligands. The presence of mutant ERs (Fuqua, S. 
A. et al. (1993) J Cell Biochem 51, 135-139) that do not rely on ligands for function (i.e., 
dominant-negative or -positive ER variants) or the absence of ER expression (ER 
negativity) would also render the ligand-based approaches ineffective. 

20 8. Elements and DNA binding sites 

102. The DNA binding sites are as discussed herein. DNA binding sites are 
regions of DNA which are recognized by DNA binding domains. The DNA binding sites 
can be any site that is recognized by one of the disclosed DNA binding domains. For 
example, the DNA binding sites can be inverted repeats and direct repeats. When a region 

25 of DNA is recognized by a modulator or a dimer of modulators it can be referred to as an 
. element, such as an enhance element. 

103. When an element comprises two DNA binding sites, each site can be referred 
to as a half site, i.e. each site binds one dimer and there are two monomers in a dimer, each 
of which binds one half site. Furthermore, elements have two things in common, they all 

30 involve an orientation of the DNA binding sites relative to one another, and they all involve 
a spacing of the binding sites relative to one another. The orientation of binding sites can be 
referred to a direct or inverted, and this refers to the orientation of the DNA binding sites 
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relative to one another and relative to how they typically are found in the DNA. For 
example, generically a half site can be referred to a 5'-A-B-C-3' where A, B, and C are 
specific nucleotides. An element that has a direct orientation would be one where each 
DNA binding site has the same orientation 5' to 3'. For example, half site one is 5'-A-B-C- 

5 3' and half site two is 5'-A-B-C-3\ An element that has an inverted orientation is one 
where the second DNA binding site is reversed relative to the first DNA binding site. For 
example, half site one is 5'-A-B-C-3' and half site two is 5'-C-B-A-3\ In the preceding 
examples, each DNA binding site of the element is based on the same DNA binding site, 
indicating that the dimer which would bind this element would typically be a homodimer. 

1 o As indicated dimers can be heterodimers, which are formed by different monomers, which 
would typically have different DNA binding sites. The orientation of the hetero sites can 
also be direct or inverted, but would be understood relative to the typical orientation of each 
site in DNA. For example, the first DNA binding site can generically be represented as 5'- 
A-B-C-3' and the second DNA binding site can be generically referred to as 5'-D-E-F-3\ If 

15 these sites are typically found in this position in DNA then an element oriented as 5'-A-B- 
C-3' and 5'-D-E-F-3' would be a direct element and an element oriented as 5'A-B-C-3' and 
5'-F-E-D-3' would be an inverted element. It is clear that there are as many combinations 
of DNA binding sites forming elements as there are combinations of dimers and multimers 
(more than two monomers interacting together to bind DNA), and the orientation of these 

20 sites can be any permutation. Those of skill in the art understand how to identify DNA 
binding sites, elements, and the orientation of these sites. 

104. The other common aspect of elements is the spacing between the DNA 
binding sites. This spacing defines how far the DNA binding sites are from one another. 
For example, given the DNA binding site, 5'A-B-C-3' the element might be 5'-A-B-C-x-x- 

25 x-C-B-A-3'. This element has an inverted orientation and a spacing of three, because there 
are three units, or nucleotide for example, between the half sites. 

105. Examples of Estrogen response elements (ERE) are present in the human 
pS2 (Nunez, A. M. et al. (1989) EMBO J 8, 823-829), oxytocin (Richard, S. et al. (1990) J 
Biol Chem 265, 6098-6103), and lactoferrin (Teng, C. T. et al. (1992) Mol Endocrinol 6, 

30 1 969-198 1) genes. They have variant EREs having sequences 5'-GGTCAcggTGGCC-3' 
SEQ ID NO:6, 5'-GGTGAcctTGACC-3' (SEQ ID NO:7) and 5'-GGTCAaggCGATC-3' 
(SEQ ID NO:8), respectively; variations from the consensus are underlined. In certain 
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embodiments the disclosed compositions, such as linked modulators attached to an AD can 
enhance transcription from non-consensus sites. 

106. Glucocorticoid (GR), Progesterone (PR) and androgen (AR) receptors 
recognize hormone response elements (REs) having half-sites, 5-AGAACA-3', similar to 

5 that of the ERE (Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. 
(1995) Cell 83, 835-839). Thyroid (TR), retinoid X (RXR), ail-trans retinoic acid (RAR) 
and vitamin D (VD) receptors, on the other hand, interact with REs with half-sites, 5- 
AGGTCA-3', identical to that of ERE (Freedman, L. P. (1992) Endocr Rev 13, 129-145; 
Mangelsdorf, DJ. et al. (1995) Cell 83, 835-839). These sites however are arranged in 
10 direct (RXRE, RARE, VDRE, TRE) or inverted (G/P/A RE) repeats with various spacing 
(Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, D.J. et al. (1995) Cell 83, 
835-839). ERE half-sites without spacing also produce a palindromic TRE (Freedman, L. P. 
(1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. (1995) Cell 83, 835-839). 

107. It is understood that the DNA binding sites to which the compositions bind 
15 can be from hormone response elements. Hormone response elements are well-known in 

the art and understood to be specific DNA sequences located in regulatory regions of many 
genes that allow for ligand-dependent regulation of gene expression through trans-acting 
factors. Such regions generally comprise two half- sites. Examples ^of response elements in 
the art include elements containing two consensus hexameric half-sites. The identity of a 

20 response element resides in three features: the sequence of the base pairs in the half-site, the 
number of base pairs between the half-sites and the relative orientation of the two half-sites. 
Thus each receptor protein dimer that binds the DNA has to recognize the sequence, spacing 
and orientation of the half-sites within their response element. The disclosed compositions 
can in certain embodiments bind any response element. For example, in certain 

25 embodiments the bind a hormone response element selected from the group of response 
elements consisting of Estrogen Response Element (ERE), Glucocorticoid Response 
Element (GRE), Androgen Response Element (ARE), Progesterone Response Element 
(PRE), Thyroid Response Element (TRE), Retinoid X Response Element (RXRE), all-trans 
Retinoic Acid Response Elements (RARE), and Vitamin D Response Element (VDRE). As 

30 hormone response elements can comprise two half-sites, specifically disclosed are 

compositions that bind a DNA binding site, wherein the DNA binding site comprises a first 
half-site and a second half-site. 
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1 08. Many half-sites of hormone response elements are well-known in the art. 
For example, one such half-site is the sequence S'-AGAACA-S' (SEQ ID NO. 1). Another 
example includes 5'-AGGTCA-3' (SEQ ED NO: 2). Therefore, also disclosed are 
compositions that bind a DNA binding site wherein the DNA binding site comprises a first 
5 half-site and a second half-site, and wherein the first half-site comprises the sequence of 5'- 
AGAACA-3* (SEQ ID NO: 1). Also disclosed are compositions, wherein the first half-site 
comprises the sequence of 5'-AGGTCA-3' (SEQ ID NO: 2). Similarly disclosed are 
compositions that bind a DNA binding site wherein the DNA binding site comprises a first 
half-site and a second half-site, and wherein the second half-site comprises SEQ ID NO: 1. 

10 Also disclosed are compositions, wherein the second half-site comprises the sequence of 
SEQ ID NO: 2. It is understood and herein contemplated that half-sites can be repeats 
including direct and inverted repeats. Therefore in one embodiment the compositions 
interact with a first half-site and the second half-site in which the first and second half sites 
are direct repeats. In other embodiments the first and second half sites can in be inverted 

15 repeats. As the number of base-pairs between half-sites can be important for recognition by 
a DNA binding domain, specifically contemplated are compositions, wherein the first half- 
site and the second half site are not separated by a nucleotide. Also disclosed are 
compositions, wherein the first half-site and the second half site are separated by a 1, 2, 3, 4, 
5, 10, or 15 nucleotide spacer. 

20 109. Examples of DNA binding sites are well known in the art. Specifically 

disclosed is each and every DNA binding site. In particular, specifically disclosed are 
binding sites for hormone receptors. Thus, in one embodiment the compositions comprise a 
DNA binding domain bind that binds a DNA binding site from a hormone response 
element. It is understood that DNA binding sites can include but are not limited to 5'- 

25 AG AAC AnnnTGTTCT-3 9 (SEQ ID NO: 3), 5 '-AGGTCAnAGGTCA-3 ' (SEQ ID NO: 4), 
5 '-AGGTC ATGACCT-3 ' (SEQ ID NO: 5), 5'-GGTCAcggTGGCC-3' (SEQ ID NO: 6), 5'- 
GGTGAcctTGACC -3' (SEQ ID NO: 7), 5'- GGTCAaggCGATC -3' (SEQ ID NO: 8), or 
5'- GGTCAnnnTGACC -3' (SEQ ID NO: 9), GGGCAtcgTGACC (SEQ ID NO: 10) human 
angitensinogen gene ERE, GGTCGccaGGACC (SEQ ID NO: 1 1) human BCL-2 gene ERE, 

30 GGTCAggcTGGTC (SEQ ID NO: 12) human BRCA-1 gene ERE, GGCCGggcTGACC 
(SEQ ID NO: 13) human Calbinding-D9 gene ERE, GGCCGggcTGACC (SEQ ID 

NO: 14) human cathepsin D gene ERE, GGCCAcgaTGACA (SEQ ID NO: 1 5) human 
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choline acetyltransferase gene ERE, GGTGGcccTGACC (SEQ ID NO: 16) human 
complement-3 gene ERE, GGTCAaggTGACC (SEQ ID NO: 1 7) human cytochorome c 
oxidase subunit Vila-related protein (COX7RP) gene ERE, GGTCAtggTGACC (SEQ ID 
NO: 18) human estrogen responsive finger protein gene ERE, GCAGGagcTGACC (SEQ ID 
5 NO: 19) human Progesterone receptor gene ERE, GGTCAgcgTGGCC (SEQ ID NO: 20) 
human pS2 gene ERE, AATCAgacTGACT (SEQ ID NO: 21) human VEGF gene ERE, 
GGTCAggcTGGTC (SEQ ID NO: 22) human Alu gene ERE. 
9. Homology 

110. It is understood that one way to define any known variants and derivatives or 

10 those that might arise, of the disclosed genes and proteins herein is through defining the 
variants and derivatives in terms of homology to specific known sequences. For example 
SEQ ID NO: 46 sets forth a particular sequence of an ERa and SEQ ID NO: 45 sets forth a 
particular sequence of the protein encoded by SEQ ID NO: 46, an ERa protein. Specifically 
disclosed are variants of these and other genes and proteins herein disclosed which have at 

15 least, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99 percent homology to the stated sequence. Those of skill in the art 
readily understand how to determine the homology of two proteins or nucleic acids, such as 
genes. For example, the homology can be calculated after aligning the two sequences so 
that the homology is at its highest level. 

20 111. In general, it is understood that one way to define any known variants and 

derivatives or those that might arise, of the disclosed genes and proteins herein, is through 
defining the variants and derivatives in terms of homology to specific known sequences. 
This identity of particular sequences disclosed herein is also discussed elsewhere herein. In 
general, variants of genes and proteins herein disclosed typically have at least, about 70, 71, 

25 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 
96, 97, 98, or 99 percent homology to the stated sequence or the native sequence. Those of 
skill in the art readily understand how to determine the homology of two proteins or nucleic 
acids, such as genes. For example, the homology can be calculated after aligning the two 
sequences so that the homology is at its highest level. 

30 112. Another way of calculating homology can be performed by published 

algorithms. Optimal alignment of sequences for comparison may be conducted by the local 
homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the 
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homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by 
the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 
2444 (1988), by computerized implementations of these algorithms (GAP, BESTFTT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
5 Group, 575 Science Dr., Madison, WI), or by inspection. 

113. The same types of homology can be obtained for nucleic acids by for 
example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. 
Natl Acad. Set USA 86:7706-7710, 1989, Jaeger et al. Methods EnzymoL 183:281-306, 
1989 which are herein incorporated by reference for at least material related to nucleic acid 

10 alignment. It is understood that any of the methods typically can be used and that in certain 
instances the results of these various methods may differ, but the skilled artisan understands 
if identity is found with at least one of these methods, the sequences would be said to have 
the stated identity, and be disclosed herein. 

1 14. For example, as used herein, a sequence recited as having a particular percent 
15 homology to another sequence refers to sequences that have the recited homology as 

calculated by any one or more of the calculation methods described above. For example, a 
first sequence has 80 percent homology, as defined herein, to a second sequence if the first 
sequence is calculated to have 80 percent homology to the second sequence using the Zuker 
calculation method even if the first sequence does not have 80 percent homology to the 

20 second sequence as calculated by any of the other calculation methods. As another 

example, a first sequence has 80 percent homology, as defined herein, to a second sequence 
if the first sequence is calculated to have 80 percent homology to the second sequence using 
both the Zuker calculation method and the Pearson and Lipman calculation method even if 
the first sequence does not have 80 percent homology to the second sequence as calculated 

25 by the Smith and Waterman calculation method, the Needleman and Wunsch calculation 
method, the Jaeger calculation methods, or any of the other calculation methods. As yet 
another example, a first sequence has 80 percent homology, as defined herein, to a second 
sequence if the first sequence is calculated to have 80 percent homology to the second 
sequence using each of calculation methods (although, in practice, the different calculation 

30 methods will often result in different calculated homology percentages). 
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10. Hybridization/selective hybridization 

115. The term hybridization typically means a sequence driven interaction 
between at least two nucleic acid molecules, such as a primer or a probe and a gene. 
Sequence driven interaction means an interaction that occurs between two nucleotides or 

5 nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, 
G interacting with C or A interacting with T are sequence driven interactions. Typically 
sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the 
nucleotide. The hybridization of two nucleic acids is affected by a number of conditions 
and parameters known to those of skill in the art. For example, the salt concentrations, pH, 
10 and temperature of the reaction all affect whether two nucleic acid molecules will hybridize. 

116. Parameters for selective hybridization between two nucleic acid molecules 
are well known to those of skill in the art. For example, in some embodiments selective 
hybridization conditions can be defined as stringent hybridization conditions. For example, 

15 stringency of hybridization is controlled by both temperature and salt concentration of either 
or both of the hybridization and washing steps. For example, the conditions of 
hybridization to achieve selective hybridization may involve hybridization in high ionic 
strength solution (6X SSC or 6X SSPE) at a temperature that is about 12-25°C below the 
Tm (the melting temperature at which half of the molecules dissociate from their 

20 hybridization partners) followed by washing at a combination of temperature and salt 

concentration chosen so that the washing temperature is about 5°C to 20°C below the Tm. 
The temperature and salt conditions are readily determined empirically in preliminary 
experiments in which samples of reference DNA immobilized on filters are hybridized to a 
labeled nucleic acid of interest and then washed under conditions of different stringencies. 

25 Hybridization temperatures are typically higher for DNA-RNA and RNA-RNA 

hybridizations. The conditions can be used as described above to achieve stringency, or as 
is known in the art. (Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd Ed., 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; Kunkel et al. 
Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference for 

30 material at least related to hybridization of nucleic acids). A preferable stringent 

hybridization condition for a DNA:DNA hybridization can be at about 68°C (in aqueous 
solution) in 6X SSC or 6X SSPE followed by washing at 68°C. Stringency of hybridization 
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and washing, if desired, can be reduced accordingly as the degree of complementarity 
desired is decreased, and further, depending upon the G-C or A-T richness of any area 
wherein variability is searched for. Likewise, stringency of hybridization and washing, if 
desired, can be increased accordingly as homology desired is increased, and further, 
5 depending upon the G-C or A-T richness of any area wherein high homology is desired, all 
as known in the art. 

1 1 7. Another way to define selective hybridization is by looking at the amount 
(percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in 
some embodiments selective hybridization conditions would be when at least about, 60, 65, 

10 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 

94, 95, 96, 97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non-limiting 
nucleic acid. Typically, the non-limiting primer is in for example, 10 or 100 or 1000 fold 
excess. This type of assay can be performed at under conditions where both the limiting and 
non-limiting primer are for example, 10 fold or 100 fold or 1000 fold below their kd, or 

1 5 where only one of the nucleic acid molecules is 1 0 fold or 1 00 fold or 1 000 fold or where 
one or both nucleic acid molecules are above their k<j. 

1 1 8. Another way to define selective hybridization is by looking at the percentage 
of primer that gets enzymatically manipulated under conditions where hybridization is 
required to promote the desired enzymatic manipulation. For example, in some 

20 embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 
71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 

95, 96, 97, 98, 99, 100 percent of the primer is enzymatically manipulated under conditions 
which promote the enzymatic manipulation, for example if the enzymatic manipulation is 
DNA extension, then selective hybridization conditions would be when at least about 60, 

25 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 
93, 94, 95, 96, 97, 98, 99, 100 percent of the primer molecules are extended. Preferred 
conditions also include those suggested by the manufacturer or indicated in the art as being 
appropriate for the enzyme performing the manipulation. 

119. Just as with homology, it is understood that there are a variety of methods 
30 herein disclosed for determining the level of hybridization between two nucleic acid 

molecules. It is understood that these methods and conditions may provide different 
percentages of hybridization between two nucleic acid molecules, but unless otherwise 
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indicated meeting the parameters of any of the methods would be sufficient. For example if 
80% hybridization was required and as long as hybridization occurs within the required 
parameters in any one of these methods it is considered disclosed herein. 

120. It is understood that those of skill in the art understand that if a composition 

5 or method meets any one of these criteria for determining hybridization either collectively or 
singly it is a composition or method that is disclosed herein. 
11. Nucleic acids 

121 . There are a variety of molecules disclosed herein that are nucleic acid based, 
including for example the nucleic acids that encode, for example, ERa, ERp, AR, PR, GR, 

10 TR, RXRa, RARa, and VPR as well as any other proteins disclosed herein, as well as 

various functional nucleic acids. The disclosed nucleic acids are made up of for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these 
and other molecules are discussed herein. It is understood that for example, when a vector 
is expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. 

15 Likewise, it is understood that if, for example, an antisense molecule is introduced into a 
cell or cell environment through for example exogenous delivery, it is advantagous that the 
antisense molecule be made up of nucleotide analogs that reduce the degradation of the 
antisense molecule in the cellular environment. 

a) Nucleotides and related molecules 

20 122. A nucleotide is a molecule that contains a base moiety, a sugar moiety and a 

phosphate moiety. Nucleotides can be linked together through their phosphate moieties and 
sugar moieties creating an internucleoside linkage. The base moiety of a nucleotide can be 
adenin-9-yl (A), cytosin-l-yl (C), guanin-9-yl (G), uracil-l-yl (U), and thymin-l-yl (T). The 
sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a 

25 nucleotide is pentavalent phosphate. An non-limiting example of a nucleotide would be 3- 
AMP (3'-adenosine monophosphate) or 5'-GMP (5-guanosine monophosphate). 

123. A nucleotide analog is a nucleotide which contains some type of 
- modific ation to either the base, sugar, or phosphate moieties. Modifications to nucleotides 
.are well known in the art and would include for example, 5-methylcytosine (5-me-C), 

30 5-hydroxymethyl cytosine, xanthine, hypoxanthine, and 2-aminoadenine as well as 
modifications at the sugar or phosphate moieties. 
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124. Nucleotide substitutes are molecules having similar functional properties to 
nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid 
(PNA). Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson- 
Crick or Hoogsteen manner, but which are linked together through a moiety other than a 

5 phosphate moiety. Nucleotide substitutes are able to conform to a double helix type 
structure when interacting with the appropriate target nucleic acid. 

125. It is also possible to link other types of molecules (conjugates) to nucleotides 
or nucleotide analogs to enhance for example, cellular uptake. Conjugates can be 
chemically linked to the nucleotide or nucleotide analogs. Such conjugates include but are 

10 not limited to lipid moieties such as a cholesterol moiety. (Letsinger et al., Proc. Natl. Acad. 
Sci. USA, 1989,86, 6553-6556), 

126. A Watson-Crick interaction is at least one interaction with the Watson-Crick 
face of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick face of 
a nucleotide, nucleotide analog, or nucleotide substitute includes the C2, Nl, and C6 

1 5 positions of a purine based nucleotide, nucleotide analog, or nucleotide substitute and the 
C2, N3, C4 positions of a pyrimidine based nucleotide, nucleotide analog, or nucleotide 
substitute. 

127. A Hoogsteen interaction is the interaction that takes place on the Hoogsteen 
face of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex 

20 DNA. The Hoogsteen face includes the N7 position and reactive groups (NH2 or O) at the 
C6 position of purine nucleotides. 

12. Delivery of the compositions to cells 

128. There are a number of compositions and methods which can be used to 
deliver nucleic acids to cells, either in vitro or in vivo. These methods and compositions 

25 can largely be broken down into two classes: viral based delivery systems and non-viral 

based delivery systems. For example, the nucleic acids can be delivered through a number 
of direct delivery systems such as, electroporation, lipofection, calcium phosphate 
precipitation, plasmids, viral vectors, viral nucleic acids, phage nucleic acids, phages, 
cosmids, or via transfer of genetic material in cells or carriers such as cationic liposomes. 

30 Appropriate means for transfection, including viral vectors, chemical transfectants, or 
physico-mechanical methods such as electroporation and direct diffusion of DNA, are 
described by, for example, Wolff, J. A., et al., Science, 247, 1465-1468, (1990); and Wolff, 
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J. A. Nature, 352, 815-818, (1991)Such methods are well known in the art and readily 
adaptable for use with the compositions and methods described herein. In certain cases, the 
methods will be modifed to specifically function with large DNA molecules. Further, these 
methods can be used to target certain diseases and cell populations by using the targeting 

5 characteristics of the carrier. 

a) Nucleic acid based delivery systems 
129. Transfer vectors can be any nucleotide construction used to deliver genes 
into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of 
recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). 

10 1 30. As used herein, plasmid or viral vectors are agents that transport the 

disclosed nucleic acids, such as (VP16) 2 -CDC-(VP16) 2 into the cell without degradation and 
include a promoter yielding expression of the gene in the cells into which it is delivered. 
Viral vectors are , for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia 
virus, Polio vims, AIDS virus, neuronal trophic virus, Sindbis and other RNA viruses, 

15 including these viruses with the HTV backbone. Also preferred are any viral families which 
share the properties of these viruses which make them suitable for use as vectors. 
Retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that 
express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a 
larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this 

20 reason are a commonly used vector. However, they are not as useful in non-proliferating 
cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and 
can be delivered in aerosol formulation, and can transfect non-dividing cells. Pox viral 
vectors are large and have several sites for inserting genes, they are thermostable and can be 
stored at room temperature. A preferred embodiment is a viral vector which has been 

25 engineered so as to suppress the immune response of the host organism, elicited by the viral 
antigens. Preferred vectors of this type will carry coding regions for Interleukin 8 or 10. 

131. Viral vectors can have higher transaction (ability to introduce genes) abilities 
than chemical or physical methods to introduce genes into cells. Typically, viral vectors 
contain, nonstructural early genes, structural late genes, an RNA polymerase m transcript, 

30 inverted terminal repeats necessary for replication and encapsidation, and promoters to 

control the transcription and replication of the viral genome. When engineered as vectors, 
viruses typically have one or more of the early genes removed and a gene or gene/promotor 
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cassette is inserted into the viral genome in place of the removed viral DNA. Constructs of 
this type can carry up to about 8 kb of foreign genetic material. The necessary functions of 
the removed early genes are typically supplied by cell lines which have been engineered to 
express the gene products of the early genes in trans. 
5 (1) Retroviral Vectors 

132. A retrovirus is an animal virus belonging to the virus family of Retro viridae, 
including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are 
described by Verma, I.M., Retroviral vectors for gene transfer. In Microbiology- 1985, 
American Society for Microbiology, pp. 229-232, Washington, (1985), which is 

10 incorporated by reference herein. Examples of methods for using retroviral vectors for gene 
therapy are described in U.S. Patent Nos. 4,868,1 16 and 4,980,286; PCT applications WO 
90/02806 and WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of 
which are incorporated herein by reference. 

1 33. A retrovirus is essentially a package which has packed into it nucleic acid 
15 cargo. The nucleic acid cargo carries with it a packaging signal, which ensures that the 

replicated daughter molecules will be efficiently packaged within the package coat. In 
addition to the package signal, there are a number of molecules which are needed in cis, for 
the replication, and packaging of the replicated virus. Typically a retroviral genome, 
contains the gag, pol, and env genes which are involved in the making of the protein coat. It 

20 is the gag, pol, and env genes which are typically replaced by the foreign DNA that it is to 
be transferred to the target cell. Retrovirus vectors typically contain a packaging signal for 
incorporation into the package coat, a sequence which signals the start of the gag 
transcription unit, elements necessary for reverse transcription, including a primer binding 
site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide 

25 the switch of RNA strands during DNA synthesis, a purine rich sequence 5' to the 3' LTR 
that serve as the priming site for the synthesis of the second strand of DNA synthesis, and 
specific sequences near the ends of the LTRs that enable the insertion of the DNA state of 
the retrovirus to insert into the host genome. The removal of the gag, pol, and env genes 
allows for about 8 kb of foreign sequence to be inserted into the viral genome, become 

30 reverse transcribed, and upon replication be packaged into a new retroviral particle. This 

amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the 
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size of each transcript. It is preferable to include either positive or negative selectable 
markers along with other genes in the insert. 

1 34. Since the replication machinery and packaging proteins in most retroviral 
vectors have been removed (gag, pol, and env), the vectors are typically generated by 

5 placing them into a packaging cell line. A packaging cell line is a cell line which has been 
transfected or transformed with a retrovirus that contains the replication and packaging 
machinery, but lacks any packaging signal. When the vector carrying the DNA of choice is 
transfected into these cell lines, the vector containing the gene of interest is replicated and 
packaged into new retroviral particles, by the machinery provided in cis by the helper cell. 

10 The genomes for the machinery are not packaged because they lack the necessary signals. 

(2) Adenoviral Vectors 

135. The construction of replication-defective adenoviruses has been described 
(Berkner et al., J. Virology 61:1213-1220 (1987); Massie et al., Mol. Cell. Biol. 6:2872- 
2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J. 

15 Virology 61: 1226-1239 (1987); Zhang "Generation and identification of recombinant 

adenovirus by liposome-mediated transfection and PCR analysis" BioTechniques 15:868- 
872 (1993)). The benefit of the use of these viruses as vectors is that they are limited in the 
extent to which they can spread to other cell types, since they can replicate within an initial 
infected cell, but are unable to form new infectious viral particles. Recombinant 

20 adenoviruses have been shown to achieve high efficiency gene transfer after direct, in vivo 
delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a 
number of other tissue sites (Morsy, J. Clin. Invest. 92:1580-1586 (1993); Kirshenbaum, 
J. Clin. Invest. 92:381-387 (1993); Roessler, J. Clin. Invest. 92:1085-1092 (1993); 
Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); 

25 Gomez-Foix, J. Biol. Chem. 267:25 1 29-25 1 34 (1 992); Rich, Human Gene Therapy 4:46 1 - 
476 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 
73:1201-1207 (1993); Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 
(1993); Caillaud, Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 
74:501-507 (1993)). Recombinant adenoviruses achieve gene transduction by binding to 

30 specific cell surface receptors, after which the virus is internalized by receptor-mediated 
endocytosis, in the same manner as wild type or replication-defective adenovirus 
(Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J. Virology 
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12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985); Seth, et al., J. 
Virol. 51:650-655 (1984); Seth/et al., Mol. Cell. Biol. 4:1528-1533 (1984); Varga et al., J. 
Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)). 

136. A viral vector can be one based on an adenovirus which has had the El gene 
5 removed and these virons are generated in a cell line such as the human 293 cell line. In 

another preferred embodiment both the El and E3 genes are removed from the adenovirus 
genome. 

(3) Adeno-asscociated viral vectors 

137. Another type of viral vector is based on an adeno-associated virus (AAV). 
10 This defective parvovirus is a preferred vector because it can infect many cell types and is 

nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type 
AAV is known to stably insert into chromosome 19. Vectors which contain this site 
specific integration property are preferred. An especially preferred embodiment of this type 
of vector is the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain 
15 the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the 
gene encoding the green fluorescent protein, GFP. 

138. In another type of AAV virus, the AAV contains a pair of inverted terminal 
repeats (ITRs) which flank at least one cassette containing a promoter which directs cell- 
specific expression operably linked to a heterologous gene. Heterologous in this context 

20 refers to any nucleotide sequence or gene which is not native to the AAV or B 1 9 parvovirus. 

139. Typically the AAV and B 1 9 coding regions have been deleted, resulting in a 
safe, noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and 
site-specific integration, but not cytotoxicity, and the promoter directs cell-specific 
expression. United states Patent No. 6,261,834 is herein incorproated by reference for 

25 material related to the AAV vector. 

140. The vectors of the present invention thus provide DNA molecules which are 
capable of integration into a mammalian chromosome without substantial toxicity. 

141. The inserted genes in viral and retroviral usually contain promoters, and/or 
enhancers to help control the expression of the desired gene product. A promoter is 

30 generally a sequence or sequences of DNA that function when in a relatively fixed location 
in regard to the transcription start site. A promoter contains core elements required for basic 
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interaction of RNA polymerase and transcription factors, and may contain upstream 
elements and response elements. 

(4) Large payload viral vectors 

142. Molecular genetic experiments with large human herpesviruses have 

5 provided a means whereby large heterologous DNA fragments can be cloned, propagated 
and established in cells permissive for infection with herpesviruses (Sun et al., Nature 
genetics 8: 33-41, 1994; Cotter and Robertson,. Curr Opin Mol Ther 5: 633-644, 1999). 
These large DNA viruses (herpes simplex virus (HSV) and Epstein-Barr virus (EBV), have 
the potential to deliver fragments of human heterologous DNA > 150 kb to specific cells. 

10 EBV recombinants can maintain large pieces of DNA in the infected B-cells as episomal 
DNA. Individual clones carried human genomic inserts up to 330 kb appeared genetically 
stable The maintenance of these episomes requires a specific EBV nuclear protein, EBNA1, 
constitutively expressed during infection with EBV. Additionally, these vectors can be used 
for transfection, where large amounts of protein can be generated transiently in vitro. 

15 Herpesvirus amplicon systems are also being used to package pieces of DNA > 220 kb and 
to infect cells that can stably maintain DNA as episomes. 

143. Other useful systems include, for example, replicating and host-restricted 
non-replicating vaccinia virus vectors. 

b) Non-nucleic acid based systems 

20 144. The disclosed compositions can be delivered to the target cells in a variety of 

ways. For example, the compositions can be delivered through electroporation, or through 
lipofection, or through calcium phosphate precipitation. The delivery mechanism chosen 
will depend in part on the type of cell targeted and whether the delivery is occurring for 
example in vivo or in vitro. 

25 145. Thus, the compositions can comprise, in addition to the disclosed vectors, 

lipids such as liposomes, such as cationic liposomes (e.g., DOTMA, DOPE, 
DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins to facilitate 
targeting- a particular cell, if desired. Administration of a composition comprising a 
compound and a cationic liposome can be administered to the blood afferent to a target 

30 organ or inhaled into the respiratory tract to target cells of the respiratory tract. Regarding 
liposomes, see, e.g., Brigham et zl.Am. J. Resp. Cell, Mol Biol 1:95-100 (1989); Feigner et 
al. Proc. Natl Acad. Sci USA 84:7413-7417 (1987); U.S. Pat. No.4,897,355. Furthermore, 



— 56 — 



WO 2005/033291 PCT/US2004/032561 

the compound can be administered as a component of a microcapsule that can be targeted to 
specific cell types, such as macrophages, or where the diffusion of the compound or delivery 
of the compound from the microcapsule is designed for a specific rate or dosage. 

1 46. In the methods described above which include the administration and uptake 

5 of exogenous DNA into the cells of a subj ect (i .e. , gene transduction or transfection), 

delivery of the compositions to cells can be via a variety of mechanisms. As one example, 
delivery can be via a liposome, using commercially available liposome preparations such as 
LIPOFECTIN, LIPOFECT AMINE (GD3CO-BRL, Inc., Gaithersburg, MD), SUPERFECT 
(Qiagen, Inc. Hilden, Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, 

10 WI), as well as other liposomes developed according to procedures standard in the art. In 
addition, the nucleic acid or vector of this invention can be delivered in vivo by 
electroporation, the technology for which is available from Genetronics, Inc. (San Diego, 
CA) as well as by means of a SONOPORATION machine (ImaRx Pharmaceutical Corp., 
Tucson, AZ). 

15 147. The materials may be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These may be targeted to a particular cell type via 
antibodies, receptors, or receptor ligands. The following references are examples of the use 
of this technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate 
Chem., 2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer, 60:275-281, (1989); Bagshawe, 

20 et al., Br. J. Cancer. 58:700-703, (1988); Senter, et al., Bioconiugate Chem.. 4:3-9, (1993); 
Battelli, et al., Cancer Immunol. Immunother., 35:421-425, (1992); Pietersz and McKenzie, 
Tmmunolog. Reviews. 129:57-80, (1992); and Roffler, et al., Biochem. Pharmacol, 
42:2062-2065, (1991)). These techniques can be used for a variety of other speciifc cell 
types. Vehicles such as "stealth" and other antibody conjugated liposomes (including lipid 

25 mediated drug targeting to colonic carcinoma), receptor mediated targeting of DNA through 
cell specific ligands, lymphocyte directed tumor targeting, and highly specific therapeutic 
retroviral targeting of murine glioma cells in vivo. The following references are examples of 
the use of this technology to target specific proteins to tumor tissue (Hughes et al., Cancer 
Research, 49:6214-6220, (1989); and Litzinger and Huang, Biochimica et Biophvsica Acta, 

30 1 1 04: 1 79-1 87, (1 992)). In general, receptors are involved in pathways of endocytosis, either 
constitutive or ligand induced. These receptors cluster in clathrin-coated pits, enter the cell 
via clathrin-coated vesicles, pass through an acidified endosome in which the receptors are 
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sorted, and then either recycle to the cell surface, become stored intracellular^, or are 
degraded in lysosomes. The internalization pathways serve a variety of functions, such as 
nutrient uptake, removal of activated proteins, clearance of macromolecules, opportunistic 
entry of viruses and toxins, dissociation and degradation of ligand, and receptor-level 

5 regulation. Many receptors follow more than one intracellular pathway, depending on the 
cell type, receptor concentration, type of ligand, ligand valency, and ligand concentration. 
Molecular and cellular mechanisms of receptor-mediated endocytosis has been reviewed 
(Brown and Greene, DNA and Cell Biology 10:6, 399-409 (1991)). 

148. Nucleic acids that are delivered to cells which are to be integrated into the 

10 host cell genome, typically contain integration sequences. These sequences are often viral 
related sequences, particularly when viral based systems are used. These viral intergration 
systems can also be incorporated into nucleic acids which are to be delivered using a non- 
nucleic acid based system of deliver, such as a liposome, so that the nucleic acid contained 
in the delivery system can be come integrated into the host genome. 

15 149. Other general techniques for integration into the host genome include, for 

example, systems designed to promote homologous recombination with the host genome. 
These systems typically rely on sequence flanking the nucleic acid to be expressed that has 
enough homology with a target sequence within the host cell genome that recombination 
between the vector nucleic acid and the target nucleic acid takes place, causing the delivered 

20 nucleic acid to be integrated into the host genome. These systems and the methods 
necessary to promote homologous recombination are known to those of skill in the art. 

c) In vivo/ex vivo 

150. As described above, the compositions can be administered in a 
pharmaceutically acceptable carrier and can be delivered to the subject's cells in vivo and/or 

25 ex vivo by a variety of mechanisms well known in the art (e.g., uptake of naked DNA, 

liposome fusion, intramuscular injection of DNA via a gene gun, endocytosis and the like). 

151. If ex vivo methods are employed, cells or tissues can be removed and 
maintained outside the body according to standard protocols well known in the art. The 
compositions can be introduced into the cells via any gene transfer mechanism, such as, for 

30 example, calcium phosphate mediated gene delivery, electroporation, microinjection or 
proleolipo somes. The transduced cells can then be infused (e.g., in a pharmaceutically 
acceptable carrier) or homotopically transplanted back into the subject per standard methods 
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for the cell or tissue type. Standard methods are known for transplantation or infusion of 
various cells into a subject. 

13. Expression systems 

152. The nucleic acids that are delivered to cells typically contain expression 

5 controlling systems. For example, the inserted genes in viral and retroviral systems usually 
contain promoters, and/or enhancers to help control the expression of the desired gene 
product. A promoter is generally a sequence or sequences of DNA that function when in a 
relatively fixed location in regard to the transcription start site. A promoter contains core 
elements required for basic interaction of RNA polymerase and transcription factors, and 

10 may contain upstream elements and response elements. 

a) Viral Promoters and Enhancers 

153. Preferred promoters controlling transcription from vectors in mammalian 
host cells may be obtained from various sources, for example, the genomes of viruses such 
as: polyoma, Simian Virus 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most 

15 preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin 
promoter. The early and late promoters of the SV40 virus are conveniently obtained as an 
SV40 restriction fragment which also contains the SV40 viral origin of replication (Fiers et 
al., Nature. 273: 113 (1978)). The immediate early promoter of the human cytomegalovirus 
is conveniently obtained as a Hindm E restriction fragment (Greenway, PJ. et al., Gene 18: 

20 355-360 (1982)). Of course, promoters from the host cell or related species also are useful 
herein. 

1 54. Enhancer generally refers to a sequence of DNA that functions at no fixed 
distance from the transcription start site and can be either 5' (Laimins, L. et al., Proc. Natl. 
Acad. Set. 78: 993 (1981)) or 3 1 (Lusky, M.L., et al., Mol. Cell Bio. 3: 1 108 (1983)) to the 

25 transcription unit. Furthermore, enhancers can be within an intron (Banerji, J .L. et al., Cell 
33: 729 (1983)) as well as within the coding sequence itself (Osborne, T.F., et al., Mol. Cell 
Bio. 4: 1293 (1984)). They are usually between 10 and 300 bp in length, and they function 
in cis. Enhancers function to increase transcription from nearby promoters. Enhancers 
also often contain response elements that mediate the regulation of transcription. Promoters 

30 can also contain response elements that mediate the regulation of transcription. Enhancers 
often determine the regulation of expression of a gene. While many enhancer sequences are 
now known from mammalian genes (globin, elastase, albumin, -fetoprotein and insulin), 
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typically one will use an enhancer from a eukaryotic cell virus for general expression. 
Preferred examples are the SV40 enhancer on the late side of the replication origin (bp 
100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 
5 1 5 5 . The promotor and/or enhancer may be specifically activated either by light or 

specific chemical events which trigger their function. Systems can be regulated by reagents 
such as tetracycline and dexamethasone. There are also ways to enhance viral vector gene 
expression by exposure to irradiation, such as gamma irradiation, or alkylating 
chemotherapy drugs. 

10 156. In certain embodiments the promoter and/or enhancer region can act as a 

constitutive promoter and/or enhancer to maximize expression of the region of the 
transcription unit to be transcribed. In certain constructs the promoter and/or enhancer 
region be active in all eukaryotic cell types, even if it is only expressed in a particular type 
of cell at a particular time. A preferred promoter of this type is the CMV promoter (650 

15 bases). Other preferred promoters are SV40 promoters, cytomegalovirus (full length 
promoter), and retroviral vector LTF. 

157. It has been shown that all specific regulatory elements can be cloned and 
used to construct expression vectors that are selectively expressed in specific cell types such 
as melanoma cells. The glial fibrillary acetic protein (GFAP) promoter has been used to 

20 selectively express genes in cells of glial origin. 

158. Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) may also contain sequences necessary for the termination 
of transcription which may affect mRNA expression. These regions are transcribed as 
polyadenylated segments in the untranslated portion of the mRNA encoding tissue factor 

25 protein. The 3' untranslated regions also include transcription termination sites. It is 

preferred that the transcription unit also contain a polyadenylation region. One benefit of 
this region is that it increases the likelihood that the transcribed unit will be processed and 
transported like mRNA. The identification and use of polyadenylation signals in 
expression constructs is well established. It is preferred that homologous polyadenylation 

30 signals be used in the transgene constructs. In certain transcription units, the 

polyadenylation region is derived from the SV40 early polyadenylation signal and consists 
of about 400 bases. It is also preferred that the transcribed units contain other standard 
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sequences alone or in combination with the above sequences improve expression from, or 
stability of, the construct. 

b) Markers 

1 59. The viral vectors can include nucleic acid sequence encoding a marker 

5 product. This marker product is used to determine if the gene has been delivered to the cell 
and once delivered is being expressed. Preferred marker genes are the E. Coli lacZ gene, 
which encodes B-galactosidase, and green fluorescent protein. 

1 60. In some embodiments the marker may be a selectable marker. Examples of 
suitable selectable markers for mammalian cells are dihydrofolate reductase (DHFR), 

10 thymidine kinase, neomycin, neomycin analog G41 8, hydromycin, and puromycin. When 
such selectable markers are successfully transferred into a mammalian host cell, the 
transformed mammalian host cell can survive if placed under selective pressure. There are 
two widely used distinct categories of selective regimes. The first category is based on a 
cell's metabolism and the use of a mutant cell line which lacks the ability to grow 

15 independent of a supplemented media. Two examples are: CHO DHFR- cells and mouse 
LTK- cells. These cells lack the ability to grow without the addition of such nutrients as 
thymidine or hypoxanthine. Because these cells lack certain genes necessary for a complete 
nucleotide synthesis pathway, they cannot survive unless the missing nucleotides are 
provided in a supplemented media. An alternative to supplementing the media is to 

20 introduce an intact DHFR or TK gene into cells lacking the respective genes, thus altering 
their growth requirements. Individual cells which were not transformed with the DHFR or 
TK gene will not be capable of survival in non-supplemented media. 

161. The second category is dominant selection which refers to a selection scheme 
used in any cell type and does not require the use of a mutant cell line. These schemes 

25 typically use a drug to arrest growth of a host cell. Those cells which have a gene would 
express a protein conveying drug resistance and would survive the selection. Examples of 
such dominant selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. Appl. 
Genet. 1 : 327 (1982)), mycophenolic acid, (Mulligan, R.C. and Berg, P. Science 209: 1422 
(1980)) or hygromycin, (Sugden, B. et al., Mol. Cell. Biol. 5: 410-413 (1985)). The three 

30 examples employ bacterial genes under eukaryotic control to convey resistance to the 

appropriate drug G418 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, 
respectively. Others include the neomycin analog G418 and puramycin. 
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14. Peptides 

a) Protein variants 

162. As discussed herein there are numerous variants of the activator protein and 
repressor protein that are known and herein contemplated. In addition, to the known 
5 functional strain variants there are derivatives of the elements that comprise the disclosed 
compositions which also function in the disclosed methods and compositions. Protein 
variants and derivatives are well understood to those of skill in the art and in can involve 
amino acid sequence modifications. For example, amino acid sequence modifications 
typically fall into one or more of three classes: substitutional, insertional or deletional 
10 variants. Insertions include amino and/or carboxyl terminal fusions as well as intrasequence 
insertions of single or multiple amino acid residues. Insertions ordinarily will be smaller 
insertions than those of amino or carboxyl terminal fusions, for example, on the order of one 
to four residues. Immunogenic fusion protein derivatives, such as those described in the 
examples, are made by fusing a polypeptide sufficiently large to confer immunogenicity to 

15 the target sequence by cross-linking in vitro or by recombinant cell culture transformed with 
DNA encoding the fusion. Deletions are characterized by the removal of one or more amino 
acid residues from the protein sequence. Typically, no more than about from 2 to 6 residues 
are deleted at any one site within the protein molecule. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the protein, 

20 thereby producing DNA encoding the variant, and thereafter expressing the DNA in 

recombinant cell culture. Techniques for making substitution mutations at predetermined 
sites in DNA having a known sequence are well known, for example Ml 3 primer 
mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single 
residues, but can occur at a number of different locations at once; insertions usually will be 

25 on the order of about from 1 to 10 amino acid residues; and deletions will range about from 
1 to 30 residues. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion 
of 2 residues or insertion of 2 residues. Substitutions, deletions, insertions or any 
combination thereof may be combined to arrive at a final construct. The mutations must not 
place the sequence out of reading frame and preferably will not create complementary 

30 regions that could produce secondary mRNA structure. Substitutional variants are those in 
which at least one residue has been removed and a different residue inserted in its place. 
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Such substitutions generally are made in accordance with the following Tables 1 and 2 and 
are referred to as conservative substitutions. 

1 63. TABLE 3: Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


Alanine 


Ala A 


Allosoleucine 


AHe 


Axginine 


ArgR 


asparagines 


Asn N 


aspartic acid 


AspD 


Cysteine 


CysC 


glutamic acid 


GluE 


Glutamine 


Gin Q 


Glycine 


GlyG 


Histidine 


His H 


Isolelucine 


He I, 


Leucine 


Leu L 


Lysine 


LysK 


phenylalanine 


PheF 


Proline 


ProP 


pyroglutamic 
acidp 


pGlu 


Serine 


Ser S 


Threonine 


ThrT [ 


Tyrosine 


TyrY 


Tryptophan 


Trp W 


Valine 


Val V 



TABLE 4:Amino Acid Substitutions 

Original Residue Exemplary Conservative Substitutions, others are known in the art. 

Ala; Ser 

Arg;Lys; Gin __ 

Asn; Gin; His 

Asp; Glu 

Cys; Ser 

Gin; Asn, Lys 

Glu; Asp 

Gly; Pro 

His; Asn; Gin 

He; Leu; Val 

Leu; lie; Val 

Lys; Arg; Gin; 

Met; Leu; He 

Phe; Met; Leu; Tyr 

Ser; Thr 

Thr; Ser 
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Trp; Tyr 

Tyr; Trp; Phe 

Val; lie; Leu 

164. Substantial changes in function or immunological identity are made by 

selecting substitutions that are less conservative than those in Table 4, i.e., selecting residues 

that differ more significantly in their effect on maintaining (a) the structure of the 

polypeptide backbone in the area of the substitution, for example as a sheet or helical 

5 conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) the 

bulk of the side chain. The substitutions which in general are expected to produce the 

greatest changes in the protein properties will be those in which (a) a hydrophilic residue, 

e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, 

phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other 

10 residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is 

substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue 

having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a 

side chain, e.g., glycine, in this case, (e) by increasing the number of sites for sulfation 

and/or glycosylation. 

15 1 65. For example, the replacement of one amino acid residue with another that is 

biologically and/or chemically similar is known to those skilled in the art as a conservative 
substitution. For example, a conservative substitution would be replacing one hydrophobic 
residue for another, or one polar residue for another. The substitutions include 
combinations such as, for example, Gly, Ala; Val, He, Leu; Asp, Glu; Asn, Gin; Ser, Thr; 

20 Lys, Arg; and Phe, Tyr. Such conservatively substituted variations of each explicitly 
disclosed sequence are included within the mosaic polypeptides provided herein. 

166. Substitutional or deletional mutagenesis can be employed to insert sites for 
N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or 
other labile residues also may be desirable. Deletions or substitutions of potential 

25 proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic residues 
or substituting one by glutaminyl or histidyl residues. 

167. Certain post-translational derivatizations are the result of the action of 
recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues 
are frequently post-translationally deamidated to the corresponding glutamyl and asparyl 

30 residues. Alternatively, these residues are deamidated under mildly acidic conditions. 
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Other post-translational modifications include hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the o- 
amino groups of lysine, arginine, and histidine side chains (T.E. Creighton, Proteins: 
Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), 
5 acetylation of the N-terminal amine and, in some instances, amidation of the C-terminal 
carboxyl. 

168. It is understood that one way to define the variants and derivatives of the 
disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO:46 sets forth a 

10 particular sequence of ERa and SEQ ID NO:45 sets forth a particular sequence of a ERa 
protein. Specifically disclosed are variants of these and other proteins herein disclosed 
which have at least, 70% or 75% or 80% or 85% or 90% or 95% homology to the stated 
sequence. Those of skill in the art readily understand how to determine the homology of 
two proteins. For example, the homology can be calculated after aligning the two sequences 

15 so that the homology is at its highest level. 

1 69. Another way of calculating homology can be performed by published 
algorithms. Optimal alignment of sequences for comparison may be conducted by the local 
homology algorithm of Smith and Waterman Adv. Appl. Math. 2: 482 (1981), by the 
homology alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by 

20 the search for similarity method of Pearson and Lipman, Proc. Natl. Acad. Sci. U.S.A. 85: 
2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, WI), or by inspection. 

170. The same types of homology can be obtained for nucleic acids by for 

25 example the algorithms disclosed in Zuker, M. Science 244:48-52, 1989, Jaeger et al. Proc. 
Natl Acad. Sci. USA 86:7706-7710, 1989, Jaeger et al. Methods Enzymol 183:281-306, 
1989 which are herein incorporated by reference for at least material related to nucleic acid 
alignment. 

171. It is understood that the description of conservative mutations and homology 
30 can be combined together in any combination, such as embodiments that have at least 70% 

homology to a particular sequence wherein the variants are conservative mutations. 
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172. As this specification discusses various proteins and protein sequences it is 
understood that the nucleic acids that can encode those protein sequences are also disclosed. 
This would include all degenerate sequences related to a specific protein sequence, i.e. all 
nucleic acids having a sequence that encodes one particular protein sequence as well as all 

5 nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and 

derivatives of the protein sequences. Thus, while each particular nucleic acid sequence may 
not be written out herein, it is understood that each and every sequence is in fact disclosed 
and described herein through the disclosed protein sequence. For example, one of the many 
nucleic acid sequences that can encode the protein sequence set forth in SEQ ID NO:45 is 

10 set forth in SEQ ID NO:46. In addition, for example, a disclosed conservative derivative of 
SEQ ID NO:24 is shown in SEQ ID NO: 81, where the isoleucine (I) at position 5 is 
changed to a valine (V). It is understood that for this mutation all of the nucleic acid 
sequences that encode this particular derivative of the ERot hinge domain are also disclosed. 

15 173. It is understood that there are numerous amino acid and peptide analogs 

which can be incorporated into the disclosed compositions. For example, there are 
numerous D amino acids or amino acids which have a different functional substituent then 
the amino acids shown in Table 3 and Table 4. The opposite stereo isomers of naturally 
occurring peptides are disclosed, as well as the stereo isomers of peptide analogs. These 

20 amino acids can readily be incorporated into polypeptide chains by charging tRNA 

molecules with the amino acid of choice and engineering genetic constructs that utilize, for 
example, amber codons, to insert the analog amino acid into a peptide chain in a site 
specific way (Thorson et al., Methods in Molec. Biol. 77:43-73 (1991), Zoller, Current 
Opinion in Biotechnology, 3:348-354 (1992); Ibba, Biotechnology & Genetic Enginerring 

25 Reviews 13:197-216 (1995), Cahill et al., TIBS, 14(10):400-403 (1989); Benner, TIB Tech, 
12:158-163 (1994); Ibba and Hennecke, Bio/technology, 12:678-682 (1994) all of which are 
herein incorporated by reference at least for material related to amino acid analogs). 

174. Molecules can be produced that resemble peptides, but which are not 
connected via a natural peptide linkage. For example, linkages for amino acids or amino 

30 acid analogs can include CH 2 NH~, -CH 2 S-, ~CH 2 ~CH 2 ~, -CH=CH- (cis and trans), 

COCH 2 -, --CH(OH)CH 2 ~, and -CHH 2 SO— (These and others can be found in Spatola, A. 
F. in Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, 
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eds., Marcel Dekker, New York, p. 267 (1983); Spatola, A. R, Vega Data (March 1983), 
Vol. 1, Issue 3, Peptide Backbone Modifications (general review); Morley, Trends Pharm 
Sci (1980) pp. 463-468; Hudson, D. et al., Int J Pept Prot Res 14:177-185 (1979) (»CH 2 NH- 
-, CH 2 CH 2 ~); Spatola et al. Life Sci 38:1243-1249 (1986) (-CH H 2 --S); Hann J. Chem. Soc 

5 Perkin Trans. 1 307-314 (1982) (-CH-CH-, cis and trans); Almquist et al. J. Med. Chem. 
23:1392-1398 (1980) («COCH 2 --); Jennings-White et al. Tetrahedron Lett 23:2533 (1982) 
(~COCH 2 ~); Szelke et al. European Appln, EP 45665 CA (1982): 97:39405 (1982) (-- 
CH(OH)CH 2 ~); Holladay et al. Tetrahedron. Lett 24:4401-4404 (1983) (~C(OH)CH 2 -); 
and Hruby Life Sci 31:189-199 (1982) (--CH 2 ~S~); each of which is incorporated herein by 

10 reference. A particularly preferred non-peptide linkage is — CH 2 NH— . It is understood that 
peptide analogs can have more than one atom between the bond atoms, such as b-alanine, g- 
aniinobutyric acid, and the like. 

1 75. Amino acid analogs and analogs and peptide analogs often have enhanced or 
desirable properties, such as, more economical production, greater chemical stability, 

15 enhanced pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered 
specificity (e.g., a broad-spectrum of biological activities), reduced antigenicity, and others. 

176. D-amino acids can be used to generate more stable peptides, because D 
amino acids are not recognized by peptidases and such. Systematic substitution of one or 
more amino acids of a consensus sequence with a D-amino acid of the same type (e.g., D- 

20 lysine in place of L-lysine) can be used to generate more stable peptides. Cysteine residues 
can be used to cyclize or attach two or more peptides together. This can be beneficial to 
constrain peptides into particular conformations. (Rizo and Gierasch Ann. Rev. Biochem. 
61:387 (1992), incorporated herein by reference). 

15. Pharmaceutical carriers/Delivery of pharamceutical products 

25 177. As described above, the compositions can also be administered in vivo in a 

pharmaceutical^ acceptable carrier. By "pharmaceutically acceptable" is meant a material 
that is not biologically or otherwise undesirable, i.e., the material may be administered to a 
subject, along with the nucleic acid or vector, without causing any undesirable biological 
effects or interacting in a deleterious manner with any of the other components of the 

30 pharmaceutical composition in which it is contained. The carrier would naturally be 

selected to minimize any degradation of the active ingredient and to minimize any adverse 
side effects in the subject, as would be well known to one of skill in the art. 
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178. The compositions may be administered orally, parenterally (e.g., 
intravenously), by intramuscular injection, by intraperitoneal injection, transdermal ly, 
extracorporeal^, topically or the like, although topical intranasal administration or 
administration by inhalant is typically preferred. As used herein, "topical intranasal 

5 administration" means delivery of the compositions into the nose and nasal passages 
through one or both of the nares and can comprise delivery by a spraying mechanism or 
droplet mechanism, or through aerosolization of the nucleic acid or vector. The latter may 
be effective when a large number of animals is to be treated simultaneously. Administration 
of the compositions by inhalant can be through the nose or mouth via delivery by a spraying 

1 0 or droplet mechanism. Delivery can also be directly to any area of the respiratory system 
(e.g., lungs) via intubation. The exact amount of the compositions required will vary from 
subject to subject, depending on the species, age, weight and general condition of the 
subject, the severity of the allergic disorder being treated, the particular nucleic acid or 
vector used, its mode of administration and the like. Thus, it is not possible to specify an 

1 5 exact amount for every composition. However, an appropriate amount can be determined 
by one of ordinary skill in the art using only routine experimentation given the teachings 
herein. 

179. Parenteral administration of the composition, if used, is generally 
characterized by injection. Injectables can be prepared in conventional forms, either as 

20 liquid solutions or suspensions, solid forms suitable for solution of suspension in liquid 
prior to injection, or as emulsions. A more recently revised approach for parenteral 
administration involves use of a slow release or sustained release system such that a 
constant dosage is maintained. See, e.g., U.S. Patent No. 3,610,795, which is incorporated 
by reference herein. 

25 180. The materials may be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These may be targeted to a particular cell type via 
antibodies, receptors, or receptor ligands. The following references are examples of the use 
of-this-technology to target specific proteins to tumor tissue (Senter, et al., Bioconjugate 
Chem.. 2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer. 60:275-281, (1989); Bagshawe, 

30 et al., Br. J. Cancer, 58:700-703, (1988); Senter, et al., Bioconjusate Chem., 4:3-9, (1993); 
Battelli, et al., Cancer Immunol. Immunother., 35:421-425, (1992); Pietersz and McKenzie, 
Tmmiinolo p. Reviews . 129:57-80, (1992); andRoffler, et al., Biochem. Pharmacol, 42:2062- 
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2065, (1991)). Vehicles such as "stealth" and other antibody conjugated liposomes 
(including lipid mediated drug targeting to colonic carcinoma), receptor mediated targeting 
of DNA through cell specific ligands, lymphocyte directed tumor targeting, and highly 
specific therapeutic retroviral targeting of murine glioma cells in vivo. The following 
5 references are examples of the use of this technology to target specific proteins to tumor 
tissue (Hughes et al, Cancer Research, 49:6214-6220, (1989); and Litzinger and Huang, 
Biochimica et Biophvsica Acta, 1 104:179-187, (1992)). In general, receptors are involved 
in pathways of endocytosis, either constitutive or ligand induced. These receptors cluster in 
clathrin-coated pits, enter the cell via clathrin-coated vesicles, pass through an acidified 

10 endosome in which the receptors are sorted, and then either recycle to the cell surface, 

become stored intracellularly, or are degraded in lysosomes. The internalization pathways 
serve a variety of functions, such as nutrient uptake, removal of activated proteins, clearance 
of macromolecules, opportunistic entry of viruses and toxins, dissociation and degradation 
of ligand, and receptor-level regulation. Many receptors follow more than one intracellular 

15 pathway, depending on the cell type, receptor concentration, type of ligand, ligand valency, 
and ligand concentration. Molecular and cellular mechanisms of receptor-mediated 
endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 10:6, 399-409 
(1991)). 

a) Pharmaceutical^ Acceptable Carriers 

20 181. The compositions, including antibodies, can be used therapeutically in 

combination with a pharmaceutically acceptable carrier. 

182. Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including 
solutions such as sterile water, saline, and buffered solutions at physiological pH. The 

25 compositions can be administered intramuscularly or subcutaneously. Other compounds 
will be administered according to standard procedures used by those skilled in the art. 

1 83. Pharmaceutical compositions may include carriers, thickeners, diluents, 
buffers, preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions may also include one or more active ingredients such as 

30 antimicrobial agents, antiinflammatory agents, anesthetics, and the like. 

1 84. The pharmaceutical composition may be administered in a number of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 
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Administration may be topically (including ophthalmically, vaginally, rectally, intranasally), 
orally, by inhalation, or parenterally, for example by intravenous drip, subcutaneous, 
intraperitoneal or intramuscular injection. The disclosed antibodies can be administered 
intravenously, intraperitoneally, intramuscularly, subcutaneously, intracavity, or 
5 transdermally. 

185. Preparations for parenteral administration include sterile aqueous or non- 
aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are 
propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic 
esters, such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous solutions, 

10 emulsions or suspensions, including saline and buffered media. Parenteral vehicles include 
sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated 
Ringer's, or fixed oils. Intravenous vehicles include fluid and nutrient replenishers, 
electrolyte replenishers (such as those based on Ringer's dextrose), and the like. 
Preservatives and other additives may also be present such as, for example, antimicrobials, 

15 anti-oxidants, chelating agents, and inert gases and the like. 

1 86. Formulations for topical administration may include ointments, lotions, 
creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical 
carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. 

1 87. Compositions for oral administration include powders or granules, suspensions 
20 or solutions in water or non-aqueous media, capsules, sachets, or tablets. Thickeners, 

flavorings, diluents, emulsifiers, dispersing aids or binders maybe desirable.. 

188. Some of the compositions may potentially be administered as a 
pharmaceutically acceptable acid- or base- addition salt, formed by reaction with inorganic 
acids such as hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thiocyanic 

25 acid, sulfuric acid, and phosphoric acid, and organic acids such as formic acid, acetic acid, 
propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic 
acid, maleic acid, and fiimaric acid, or by reaction with an inorganic base such as sodium 
hydroxide, ammonium hydroxide, potassium hydroxide, and organic bases such as mono-, 
di-, trialkyl and aryl amines and substituted ethanolamines. 

30 b) Therapeutic Uses 

189. The dosage ranges for the administration of the compositions are those large 
enough to produce the desired effect in which the symptoms disorder are effected. The 



— 70 — 



WO 2005/033291 PCT/US2004/032561 

dosage should not be so large as to cause adverse side effects, such as unwanted cross- 
reactions, anaphylactic reactions, and the like. Generally, the dosage will vary with the age, 
condition, sex and extent of the disease in the patient and can be determined by one of skill 
in the art. The dosage can be adjusted by the individual physician in the event of any 
5 counterindications. Dosage can vary, and can be administered in one or more dose 
administrations daily, for one or several days. 

16. Compositions identified by screening with disclosed compositions / 
combinatorial chemistry 

a) Combinatorial chemistry 
10 190. The disclosed compositions can be used as targets for any combinatorial 

technique to identify molecules or macromolecular molecules that interact with the 
disclosed compositions in a desired way. 

191 . It is understood that when using the disclosed compositions in combinatorial 
techniques or screening methods, molecules, such as macromolecular molecules, will be 

15 identified that have particular desired properties such as inhibition or stimulation or the 

target molecule's function. The molecules identified and isolated when using the disclosed 
compositions, such as, (VP16)2-CDC-(VP16) 2 , are also disclosed. Thus, the products 
produced using the combinatorial or screening approaches that involve the disclosed 
compositions, such as, (W16)2-CDC-(VP16) 2 , are also considered herein disclosed. 

20 192. Combinatorial chemistry includes but is not limited to all methods for 

isolating small molecules or macromolecules that are capable of binding either a small 
molecule or another macromolecule, typically in an iterative process. Proteins, 
oligonucleotides, and sugars are examples of macromolecules. For example, 
oligonucleotide molecules with a given function, catalytic or ligand-binding, can be isolated 

25 from a complex mixture of random oligonucleotides in what has been referred to as "in vitro 
genetics" (Szostak, TIBS 19:89, 1992). One synthesizes a large pool of molecules bearing 
random and defined sequences and subjects that complex mixture, for example, 
approximately 10 15 individual sequences in 100 \xg of a 100 nucleotide RNA, to some 
selection and enrichment process. Through repeated cycles of affinity chromatography and 

30 PCR amplification of the molecules bound to the ligand on the column, Ellington and 

Szostak (1990) estimated that 1 in 10 10 RNA molecules folded in such a way as to bind a 
small molecule dyes. DNA molecules with such ligand-binding behavior have been isolated 
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as well (Ellington and Szostak, 1992; Bock et al, 1992). Techniques aimed at similar goals 
exist for small organic molecules, proteins, antibodies and other macromolecules known to 
those of skill in the art. Screening sets of molecules for a desired activity whether based on 
small organic libraries, oligonucleotides, or antibodies is broadly referred to as 
combinatorial chemistry. Combinatorial techniques are particularly suited for defining 
binding interactions between molecules and for isolating molecules that have a specific 
binding activity, often called aptamers when the macromolecules are nucleic acids. 

193. There are a number of methods for isolating proteins which either have de 
novo activity or a modified activity. For example, phage display libraries have been used to 
isolate numerous peptides that interact with a specific target. (See for example, United 
States Patent No. 6,031,071; 5,824,520; 5,596,079; and 5,565,332 which are herein 
incorporated by reference at least for their material related to phage display and methods 
relate to combinatorial chemistry) 

194. A preferred method for isolating proteins that have a given function is 
described by Roberts and Szostak (Roberts R. W. and Szostak J.W. Proc. Natl. Acad. Sci. 
USA, 94(23)12997-302 (1997). This combinatorial chemistry method couples the 
functional power of proteins and the genetic power of nucleic acids. An RNA molecule is 
generated in which a puromycin molecule is covalently attached to the 3 '-end of the RNA 
molecule. An in vitro translation of this modified RNA molecule causes the correct protein, 
encoded by the RNA to be translated. In addition, because of the attachment of the 
puromycin, a peptdyl acceptor which cannot be extended, the growing peptide chain is 
attached to the puromycin which is attached to the RNA. Thus, the protein molecule is 
attached to the genetic material that encodes it. Normal in vitro selection procedures can 
now be done to isolate functional peptides. Once the selection procedure for peptide 
function is complete traditional nucleic acid manipulation procedures are performed to 
amplify the nucleic acid that codes for the selected functional peptides. After amplification 
of the genetic material, new RNA is transcribed with puromycin at the 3'-end, new peptide 
is translated and another functional round of selection is performed. Thus, protein selection 
can be performed in an iterative manner just like nucleic acid selection techniques. The 
peptide which is translated is controlled by the sequence of the RNA attached to the 
puromycin. This sequence can be anything from a random sequence engineered for 
optimum translation (i.e. no stop codons etc.) or it can be a degenerate sequence of a known 
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RNA molecule to look for improved or altered function of a known peptide. The conditions 
for nucleic acid amplification and in vitro translation are well known to those of ordinary 
skill in the art and are preferably performed as in Roberts and Szostak (Roberts R.W. and 
Szostak J.W. Proc. Natl. Acad. Sci. USA, 94(23)12997-302 (1997)). 
5 195. Another preferred method for combinatorial methods designed to isolate 

peptides is described in Cohen et al. (Cohen B.A.,et al., Proc. Natl. Acad. Sci. USA 
95(24): 14272-7 (1998)). This method utilizes and modifies two-hybrid technology. Yeast 
two-hybrid systems are useful for the detection and analysis of protein :protein interactions. 
The two-hybrid system, initially described in the yeast Saccharomyces cerevisiae, is a 
10 powerful molecular genetic technique for identifying new regulatory molecules, specific to 
the protein of interest (Fields and Song, Nature 340:245-6 (1989)). Cohen et al., modified 
this technology so that interactions between synthetic or engineered peptide sequences could 
be identified which bind a molecule of choice. The benefit of this type of technology is that 
the selection is done in an intracellular environment. 
15 1 96. Using methodology well known to those of skill in the art, in combination 

with various combinatorial libraries, one can isolate and characterize those small molecules 
or macromolecules, which bind to or interact with the desired target. The relative binding 
affinity of these compounds can be compared and optimum compounds identified using 
competitive binding studies, which are well known to those of skill in the art. 
20 1 97. Techniques for making combinatorial libraries and screening combinatorial 

libraries to isolate molecules which bind a desired target are well known to those of skill in 
the art. Representative techniques and methods can be found in but are not limited to 
United States patents 5,084,824, 5,288,514, 5,449,754, 5,506,337, 5,539,083, 5,545,568, 
5,556,762, 5,565,324, 5,565,332, 5,573,905, 5,618,825, 5,619,680, 5,627,210, 5,646,285, 
25 5,663,046, 5,670,326, 5,677,195, 5,683,899, 5,688,696, 5,688,997, 5,698,685, 5,712,146, 
5,721,099, 5,723,598, 5,741,713, 5,792,431, 5,807,683, 5,807,754, 5,821,130, 5,831,014, 
5,834,195, 5,834,318, 5,834,588, 5,840,500, 5,847,150, 5,856,107, 5,856,496, 5,859,190, 
5,864,010, 5,874,443, 5,877,214, 5,880,972, 5,886,126, 5,886,127, 5,891,737, 5,916,899, 
5,919,955, 5,925,527, 5,939,268, 5,942,387, 5,945,070, 5,948,696, 5,958,702, 5,958,792, 
30 5,962,337, 5,965,719, 5,972,719, 5,976,894, 5,980,704, 5,985,356, 5,999,086, 6,001,579, 
6,004,617, 6,008,321, 6,017,768, 6,025,371, 6,030,917, 6,040,193, 6,045,671, 6,045,755, 
6,060,596, and 6,061,636. 
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198. Combinatorial libraries can be made from a wide array of molecules using a 
number of different synthetic techniques. For example, libraries containing fused 2,4- 
pyrimidinediones (United States patent 6,025,371) dihydrobenzopyrans (United States 
Patent 6,017,768and 5,821,130), amide alcohols (United States Patent 5,976,894), hydroxy- 
5 amino acid amides (United States Patent 5,972,719) carbohydrates (United States patent 
5,965,719), l,4-benzodiazepin-2,5-diones (United States patent 5,962,337), cyclics (United 
States patent 5,958,792), biaryl amino acid amides (United States patent 5,948,696), 
thiophenes (United States patent 5,942,387), tricyclic Tetrahydroquinolines (United States 
patent 5,925,527), benzofurans (United States patent 5,919,955), isoquinolines (United 
10 States patent 5,916,899), hydantoin and thiohydantoin (United States patent 5,859,190), 
indoles (United States patent 5,856,496), imidazol-pyrido-indole and imidazol-pyrido- 
benzothiophenes (United States patent 5,856,107) substituted 2-methylene-2, 3- 
dihydrothiazoles (United States patent 5,847,150), quinolines (United States patent 
5,840,500), PNA (United States patent 5,831,014), containing tags (United States patent 
15 5,721,099), polyketides (United States patent 5,712,146), morpholino-subunits (United 
States patent 5,698,685 and 5,506,337), sulfamides (United States patent 5,61 8,825), and 
benzodiazepines (United States patent 5,288,514). 

199. Screening molecules similar to (SID) 2 -CDC-(SID)2 for inhibition of hormone 
response element containing genes is a method of isolating desired compounds. 
20 200. Molecules isolated which inhibit hormone response element containing genes 

can either be competitive inhibitors or non-competitive inhibitors. 
17. Compositions with similar functions 
201. It is understood that the compositions disclosed herein have certain 
functions, such as transcription modulation or binding DNA binding sites. Disclosed herein 
25 are certain structural requirements for performing the disclosed functions, and it is 

understood that there are a variety of structures which can perform the same function which 
are related to the disclosed structures, and that these structures will ultimately achieve the 
same result, for example stimulation or inhibition of gene expression of a gene under 
control of a hormone response element. 
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D. Methods of using the compositions 

1. Methods of using the compositions as research tools 

202. The disclosed compositions can be used in a variety of ways as research 
tools. For example, the disclosed compositions, such as SEQ ID NOs:23-40 can be used to 

5 study the interactions between hormone receptors and gene expression, by for example 
acting as inhibitors of binding. 

203. Specifically disclosed are methods of identifying a gene that is under 
transcriptional control of a hormone response element comprising contacting a cell 
containing the hormone response element with the compositions disclosed herein and 

10 monitoring the cell for changes in the transcription. 

204. The compositions can be used for example as targets in combinatorial 
chemistry protocols or other screening protocols to isolate molecules that possess desired 
functional properties related to blocking hormone receptor induced gene expression. 

205. The disclosed compositions can also be used diagnostic tools related to 
15 diseases such as cancer. 

206. The disclosed compositions can be used to identify genes that are regulated 
by a particular receptor. For example, linked modulators that are designed from Estrogen 
receptor or the Progesterone receptor, can be made. Then various cell extracts, and systems 
can be used, to identify genes that are expressed in the presence of the constitutive 

20 activators, for example. The genes can be identified in any way desired, for example, by gel 
chromatography, sequencing, and differential display, or through the use of gene arrays, 
chips, or libraries. 

2. Methods of gene modification and gene disruption 

207. The disclosed compositions and methods can be used for targeted gene 

25 disruption and modification in any animal that can undergo these events. Gene modification 
and gene disruption refer to the methods, techniques, and compositions that surround the 
selective removal or alteration of a gene or stretch of chromosome in an animal, such as a 
mammal, in a way that propagates the modification through the germ line of the mammal. 
In general, a cell is transformed with a vector which is designed to homologously recombine 

30 with a region of a particular chromosome contained within the cell, as for example, 

described herein. This homologous recombination event can produce a chromosome which 
has exogenous DNA introduced, for example in frame, with the surrounding DNA. This 
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type of protocol allows for very specific mutations, such as point mutations, to be 
introduced into the genome contained within the cell. Methods for performing this type of 
homologous recombination are disclosed herein. 

208. Thus specifically disclosed and contemplated herein are nucleic acids 

5 encoding the disclosed compositions. For example specifically disclosed are nucleic acids 
encoding compositions comprising the formula W-Z-X-Y, wherein W comprises a first 
DNA binding domain, X comprises a second DNA binding domain, Z comprises a linker, 
and Y comprises a transcription modulator, and wherein the composition binds a DNA 
binding site. Also disclosed are vectors comprising nucleic acids encoding the compositions 
10 disclosed herein. 

209. One of the preferred characteristics of performing homologous 
recombination in mammalian cells is that the cells should be able to be cultured, because the 
desired recombination event occur at a low frequency. Therefore, also disclosed are cells 
comprising the disclosed vectors or nucleic acids. 

15 210. Once the cell is produced through the methods described herein, an animal 

can be produced from this cell through either stem cell technology or cloning technology. 
For example, if the cell into which the nucleic acid was transfected was a stem cell for the 
organism, then this cell, after transfection and culturing, can be used to produce an organism 
which will contain the gene modification or disruption in germ line cells, which can then in 

20 turn be used to produce another animal that possesses the gene modification or disruption in 
all of its cells. In other methods for production of an animal containing the gene 
modification or disruption in all of its cells, cloning technologies can be used. These 
technologies generally take the nucleus of the transfected cell and either through fusion or 
replacement fuse the transfected nucleus with an oocyte which can then be manipulated to 

25 produce an animal. The advantage of procedures that use cloning instead of ES technology 
is that cells other than ES cells can be transfected. For example, a fibroblast cell, which is 
very easy to culture can be used as the cell which is transfected and has a gene modification 
or disruption event take place, and then cells derived from this cell can be used to clone a 
whole animal. It is therefore understood that the present invention discloses animals 

30 comprising cells which comprise nucleic acids encoding for the disclosed compositions. 
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3. Method of treating cancer 

211. The disclosed compositions can be used to treat any disease where 
uncontrolled cellular proliferation occurs such as cancers. A non-limiting list of different 
types of cancers is as follows: lymphomas (Hodgkins and non-Hodgkins), leukemias, 

5 carcinomas, carcinomas of solid tissues, squamous cell carcinomas, adenocarcinomas, 
sarcomas, gliomas, high grade gliomas, blastomas, neuroblastomas, plasmacytomas, 
histiocytomas, melanomas, adenomas, hypoxic tumours, myelomas, AJDS-related 
lymphomas or sarcomas, metastatic cancers, or cancers in general. 

212. A representative but non-limiting list of cancers that the disclosed 

10 compositions can be used to treat is the following: lymphoma, B cell lymphoma, T cell 

lymphoma, mycosis fungoides, Hodgkin's Disease, myeloid leukemia, bladder cancer, brain 
cancer, nervous system cancer, head and neck cancer, squamous cell carcinoma of head and 
neck, kidney cancer, lung cancers such as small cell lung cancer and non-small cell lung 
cancer, neuroblastoma/glioblastoma, ovarian cancer, pancreatic cancer, prostate cancer, skin 

15 cancer, liver cancer, melanoma, squamous cell carcinomas of the mouth, throat, larynx, and 
lung, colon cancer, cervical cancer, cervical carcinoma, breast cancer, and epithelial cancer, 
renal cancer, genitourinary cancer, pulmonary cancer, esophageal carcinoma, head and neck 
carcinoma, large bowel cancer, hematopoietic cancers; testicular cancer; colon and rectal 
cancers, prostatic cancer, or pancreatic cancer. 

20 213. Compounds disclosed herein may also be used for the treatment of precancer 

conditions such as cervical and anal dysplasias, other dysplasias, severe dysplasias, 
hyperplasias, atypical hyperplasias, and neoplasias. 

214. Therefore, specifically disclosed are methods of treating cancer in a subject 
comprising administering to the subject a composition comprising the formula W-X-Y, 

25 wherein W comprises a first DNA binding domain, X comprises a second DNA binding 
domain and Y comprises a transcription modulator, and wherein the composition binds a 
DNA binding site. Treatment can involve regulating uncontrolled cellular proliferation by 
acting directly on the proliferating cells or on indirectly by regulating factors inducing the 
proliferation. Thus also disclosed are methods of inhibiting the transcription of a gene 

30 comprising contacting a cell containing the gene with the composition the formula W-X-Y, 
wherein W comprises a first DNA binding domain, X comprises a second DNA binding 
domain and Y comprises a transcription modulator, wherein the composition binds a DNA 
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binding site, and wherein the transcription modulator is a repressor domain. A further 
embodiment of the disclosed methods are methods, wherein the cell is in a subject. Also 
disclosed are methods, wherein the subject has cancer. The subject can be any animal, 
including a mammal, such as a mouse, rat, hamster, rabbit, ape, chimpanzee, orangutan, and 
5 a human. 

215. As it is possible to regulate unregulated cellular proliferation through 
inhibition of hormone response element containing genes, it is also possible to regulate 
unregulated cellular proliferation through overexpression of hormone response element 
containing genes. Therefore, disclosed are methods of overexpressing a gene in a cell 

10 comprising contacting the gene with the composition the formula W-X-Y, wherein W 

comprises a first DNA binding domain, X comprises a second DNA binding domain and Y 
comprises a transcription modulator, wherein the composition binds a DNA binding site, 
and wherein the transcription modulator is an activation domain. 

216. Thus a specific embodiment of the present invention are methods of treating 
15 cancer in a subject comprising administering to the subject the composition the formula W- 

X-Y, wherein W comprises a first DNA binding domain, X comprises a second DNA 
binding domain and Y comprises a transcription modulator, wherein the composition binds 
a DNA binding site, and wherein the composition causes the destruction of cancerous cells 
through the overexpression of genes under the control of hormone response elements. 

20 217. Throughout this application, various publications are referenced. The 

disclosures of these publications in their entireties are hereby incorporated by reference into 
this application in order to more fully describe the state of the art to which this invention 
pertains. The references disclosed are also individually and specifically incorporated by 
reference herein for the material contained in them that is discussed in the sentence in which 

25 the reference is relied upon. 

218. It will be apparent to those skilled in the art that various modifications and 
variations can be made in the present invention without departing from the scope or spirit of 
the invention. Other embodiments of the invention will be apparent to those skilled in the 
art from consideration of the specification and practice of the invention disclosed herein. It 

30 is intended that the specification and examples be considered as exemplary only, with a true 
scope and spirit of the invention being indicated by the following claims. 
E. Methods of making the compositions 
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2 1 9. The compositions disclosed herein and the compositions necessary to 
perform the disclosed methods can be made using any method known to those of skill in the 
art for that particular reagent or compound unless otherwise specifically noted. 

1. Nucleic acid synthesis 

5 220. For example, the nucleic acids, such as, the oligonucleotides to be used as 

primers can be made using standard chemical synthesis methods or can be produced using 
enzymatic methods or any other known method. Such methods can range from standard 
enzymatic digestion followed by nucleotide fragment isolation (see for example, Sambrook 
et al, Molecular Cloning: A Laboratory Manual, 2nd Edition (Cold Spring Harbor 

10 Laboratory Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely synthetic 
methods, for example, by the cyanoethyl phosphoramidite method using a Milligen or 
Beckman System lPlus DNA synthesizer (for example, Model 8700 automated synthesizer 
of Milligen-Biosearch, Burlington, MA or ABI Model 380B). Synthetic methods useful for 
making oligonucleotides are also described by Ikuta et al., Ann. Rev. Biochem. 53:323-356 

15 (1984), (phosphotriester and phosphite-triester methods), and Narang et al, Methods 

Enzymol., 65:610-620 (1980), (phosphotriester method). Protein nucleic acid molecules can 
be made using known methods such as those described by Nielsen et al., Bioconjug. Chem. 
5:3-7(1994). 

2. Peptide synthesis 

20 221 . One method of producing the disclosed proteins is to link two or more 

peptides or polypeptides together by protein chemistry techniques. For example, peptides or 
polypeptides can be chemically synthesized using currently available laboratory equipment 
using either Fmoc (9-fluorenylmethyloxycarbonyl) or Boc (iert -butyloxycarbonoyl) 
chemistry. (Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily 

25 appreciate that a peptide or polypeptide corresponding to the disclosed proteins, for 

example, can be synthesized by standard chemical reactions. For example, a peptide or 
polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other 
fragment of a peptide or protein can be synthesized and subsequently cleaved from the resin, 
thereby exposing a terminal group which is functionally blocked on the other fragment. By 

30 peptide condensation reactions, these two fragments can be covalently joined via a peptide 
bond at their carboxyl and amino termini, respectively, to form an antibody, or fragment 
thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., N.Y. 
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(1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide Synthesis. 
Springer-Verlag Inc., NY (which is herein incorporated by reference at least for material 
related to peptide synthesis). Alternatively, the peptide or polypeptide is independently 
synthesized in vivo as described herein. Once isolated, these independent peptides or 
5 polypeptides may be linked to form a peptide or fragment thereof via similar peptide 
condensation reactions. 

222. For example, enzymatic ligation of cloned or synthetic peptide segments 
allow relatively short peptide fragments to be joined to produce larger peptide fragments, 
polypeptides or whole protein domains (Abrahmsen L et aL, Biochemistry, 30:415 1 (1991)). 

10 Alternatively, native chemical ligation of synthetic peptides can be utilized to synthetically 
construct large peptides or polypeptides from shorter peptide fragments. This method 
consists of a two step chemical reaction (Dawson et al. Synthesis of Proteins by Native 
Chemical Ligation. Science, 266:776-779 (1994)). The first step is the chemoselective 
reaction of an unprotected synthetic peptide-thioester with another unprotected peptide 

15 segment containing an amino-terminal Cys residue to give a thioester-linked intermediate as 
the initial covalent product. Without a change in the reaction conditions, this intermediate 
undergoes spontaneous, rapid intramolecular reaction to form a native peptide bond at the 
ligation site (Baggiolini M et al. (1992) FEBS Lett. 307:97-101; Clark-Lewis I et al., 
J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., Biochemistry, 30:3128 (1991); 

20 Rajarathnam K et aL, Biochemistry 33:6623-30 (1994)). 

223. Alternatively, unprotected peptide segments are chemically linked where the 
bond formed between the peptide segments as a result of the chemical ligation is an 
unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)). This technique 
has been used to synthesize analogs of protein domains as well as large amounts of 

25 relatively pure proteins with full biological activity (deLisle Milton RC et al., Techniques in 
Protein Chemistry TV. Academic Press, New York, pp. 257-267 (1992)). 
3. Process claims for making the compositions 

224. Disclosed are processes for making the compositions as well as making the 
intermediates leading to the compositions. There are a variety of methods that can be used 

30 for making these compositions, such as synthetic chemical methods and standard molecular 
biology methods. It is understood that the methods of making these and the other disclosed 
compositions are specifically disclosed. 
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225. Disclosed are nucleic acid molecules produced by the process comprising 
linking in an operative way a nucleic acid comprising each element of the disclosed 
compositions. 

226. Disclosed are cells produced by the process of transforming the cell with any 
5 of the disclosed nucleic acids. Disclosed are cells produced by the process of transforming 

the cell with any of the non-naturally occurring disclosed nucleic acids. 

227. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed nucleic acids. Disclosed are any of the non-naturally 
occurring disclosed peptides produced by the process of expressing any of the disclosed 

10 nucleic acids. Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the non-naturally disclosed nucleic acids. 

228. Disclosed are animals produced by the process of transfecting a cell within 
the animal with any of the nucleic acid molecules disclosed herein. Disclosed are animals 
produced by the process of transfecting a cell within the animal any of the nucleic acid 

15 molecules disclosed herein, wherein the animal is a mammal. Also disclosed are animals 
produced by the process of transfecting a cell within the animal any of the nucleic acid 
molecules disclosed herein, wherein the mammal is mouse, rat, rabbit, cow, sheep, pig, or 
primate. 

229. Also disclose are animals produced by the process of adding to the animal 
20 any of the cells disclosed herein. 

F. Examples 

230. The following examples are put forth so as to provide those of ordinary skill 
in the art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and are intended to 

25 be purely exemplary of the invention and are not intended to limit the scope of what the 

inventors regard as their invention. Efforts have been made to ensure accuracy with respect 
to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be 
accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or 
is at ambient temperature, and pressure is at or near atmospheric. 

30 1. Example 1 

23 1 . Herein transacti vators were engineered that specifically target ERE to 
regulate the expression of ERE-containing genes. The modular nature of ER permitted the 
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design of a monomeric ERE binding module by co-joining two DNA binding domains with 
a hinge domain. Strong activation domains from other transcription factors were then 
integrated into this module to generate constitutively active ERE binding activators (EBAs). 
These activators potently induced the expression of only ERE-containing genes 
5 independent of ligand binding, dimerization, ER subtypes, promoter- and cell-context. 

Moreover, EBAs, as E2-ERa, differently altered the cell cycle progression in cells derived 
from breast cancer. While EBAs increased the cell population accumulated in the Gl phase 
of ER-negative MDA-MB-231 cells, they decreased the number of cells in the Gl phase in 
ER-positive MCF-7 cells. Thus, the targeted regulation of ERE-containing genes by the 
10 designer transregulators can provide an avenue to experimental biology and medicine. 

a) MATERIALS AND METHODS 

(1) Construction of ERE binding proteins and reporter 
constructs. 

232. The ERa, Flag-ERa expression and the reporter vectors were described 

15 previously (Muyan, M. et al. (2001) Mol Cell Endocrinol 182, 249-263; Yi, P. et al. (2002) 
Mol Endocrinol 16, 1810-1827; Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). The 
expression vectors bearing cDNAs for PR and RXRcc were provided by Drs. Edwin 
Milgrom and Ronald M. Evans, respectively (Misrahi, M. (1987) Biochem. Biophys. Res. 
Commun. 143 (McKenna, N. J. et al. (1999) Endocr Rev 20, 321-344), 740-748; 

20 Mangelsdorf, DJ. et al. (1990) Nature 345 (6272), 224-229) The promoter of the reporter 
vector is a TATA box or from the thymidine kinase (TK) that drives the expression of firefly 
luciferase cDNA as the reporter enzyme ((Muyan, M. et al. (2001) Mol Cell Endocrinol 
182, 249-263; Yi, P. et al. (2002) Mol Endocrinol 16, 1810-1827; Yi, P. et al. (2002) Mol 
Endocrinol 16, 674-693). For the engineering of the reporter construct bearing one ERE 

25 derived from the human pS2, oxytocin, lactoferrin gene, or the consensus PRE or RXRE 
sequence, strands of oligomers (Integrated DNA Technologies, Inc., Coralville, IA), were 
annealed and inserted into the reporter vector linearized with appropriate enzymes. In all 
test sequence oligomers, the flanking sequences that surround a test sequence are identical, 
were described previously (Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). For the 

30 engineering of ERE binding proteins, the cDNA for the C domain that encodes amino acids 
181-263 or for the CD domain encoding residues 181-301 was generated by PCR using the 
ERa cDNA as template (Green, S. et al. (1986) Cold Spring Harbor Symposia on 
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Quantitative Biology 51, 751-758). The 5 1 and 3' ends of cDNAs for both constructs contain 
multiple restriction enzyme sites for subsequent insertions of cDNAs for the activation 
domain (AD) of the herpes simplex virion protein, VP 16 and/or the p65 subunit of the 
nuclear factor kB, NFkB. The constructs were separately sub-cloned into the modified 
5 pBluescript-n KS (+) (Stratagene, La Jolla, CA) that bears sequences encoding for a 5' Flag 
epitope and a 3' 6xHis tag. For the generation of fusion CC, CCD or CDC, the appropriate 
fragments were joined for a corresponding cDNA. In the construction of the cDNA for the 
AD of VP 16, pVPl6 of the mammalian two-hybrid system (Invitrogen, Carlsbad, CA) was 
used as the template. The AD of VP 16 contains amino-acid residues 403-490. ThecDNA 

10 for the AD of p65 encompassing residues 416-550 (Moore, P. A. et al. (1993) Mol Cell Biol 
13, 1666-1674) was generated by PCR using a human testis cDNA library (Clontech 
Laboratories, Palo Alto, CA). The CDC and ERct mutants deficient in DNA binding 
contain amino acid substitutions that replace two of the zinc-coprdinating Cys of the first 
zinc-finger of the C domain at position 202 and 205 (Kumar, V. et al. (1988) Cell 55, 145- 

15 156) with His. These were constructed by overlapping PCR. In the generation of the 

mutant VP16 with reduced transactivation potential (Regier, J. et al. (1993) Proc Natl Acad 
Sci USA90, 883-887), Phe at position 442 was replaced with Pro by overlapping PCR. All 
constructs were sequenced. The generation of the pS2 and C3 promoter was described 
previously (Yi, P. et al. (2002) Mol Endocrinol 16, 1810-1827). The 337-bp fragment (-334 

20 to +3, 1 as the ATG codon) oxytocin promoter construct was generated from the genomic 
DNA of MCF-7 cells using primers based on the published sequence (Richard, S. et al. 
(1990) J Biol Chem 265, 6098-6103). In the engineering of reporter plasmid bearing Col 
promoter, a 145-bp fragment (-142 to +3, +1 as the start codon) was obtained by PCR from 
the genomic DNA of MCF-7 cells using primers based on the published sequence (Angel, P. 

25 et al. (1987) Mol Cell Biol 7, 2256-2266). Similarly, a 527-bp PCR fragment (-491 to +36, 
+1 indicates the transcription initiation site) of the RARct gene (Brand, N. J.et al. (1990) 
Nucleic Acids Res 18, 6799-6806) was generated. PCR fragments were inserted into pGL3- 
Basic that contains the firefly luciferase enzyme cDNA as the reporter enzyme. For the cell 
cycle analysis, the cDNA for the enhanced green fluorescent protein driven by the CMV 

30 promoter was excised from the pEGFP-Cl vector (Invitrogen, Carlsbad, CA) was inserted 
into the backbone of pCDNA 3.1(-) (Invitrogen) with appropriate enzyme. The cDNA for 
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ERct, CDC or an EBA was then inserted into the multiple cloning site of the resultant 
vector. 

(2) Synthesis and detection of ERE binding proteins in 
vitro. 

5 233. Transcription/translation of constructs in vitro was performed with a rabbit 

reticulocyte system (Promega, Madison, WI) as described previously (Muyan, M. et al. 
(2001) Mol Cell Endocrinol 182, 249-263). The reaction mixture, which contained an 
amino acid mixture that lacked Met, was divided into two aliquots. While one aliquot 
received 1 \i\ of [ 35 S]-Met (>1000 Ci/mmol; at 10 mCi/ml; NEN Life Sciences, Boston 

10 MA), the other received 1 jal of 1 mM unlabeled Met to a total of 25 \x\ reaction. Equal 

aliquots (5 jal) containing radiolabeled proteins were then subjected to electrophoresis on 4- 
1 8% gradient SDS-PAGE under reducing conditions. The gel was immersed in 1 M sodium 
salicylate for 30 min, dried and visualized by a Phospholmager (Molecular Dynamics, 
Sunnyvale, CA) or by an X-OMAT film (Kodak, Rochester, NY). The corresponding 

15 unlabeled aliquots were used in EMS A. For the estimation of relative DNA binding ability 
of the CDC protein we also used partially purified CDC from a bacterial expression system 
that utilizes pGEX-2TK as the expression vector (Amersham-Pharmacia Biotech, 
Piscataway, NJ). The GST-CDC fusion protein was expressed in E. coli BL21 (DE3)Lys 
cells and purified using GST Purification Modules (Amersham-Pharmacia) following in- 

20 column thrombin digestion of the fusion protein. Protein concentrations in eluates were 

estimated using a Bio-Rad Protein Assay Kit (Bio-Rad Labs, Hercules, CA). Equal aliquots 
of GST-fusion proteins were resolved by SDS-PAGE and visualized by Coomassie staining 
of the gel or a horse-radish peroxide conjugated 6xHis antibody (Amersham) by Western 
blotting. Human recombinant ERct expressed in baculovirus-infected insect cells was 

25 purchased from Panvera, Inc. (Madison, Wl). 

(3) Electrophoretic Gel mobility shift assay (EMSA). 
234. Double stranded oligomers were annealed and [ 32 P] end-labeled with a 3,000 

Ci/mmol specific activity isotope and used in EMSA assays as described (Muyan, M. et al. 
(2001) Mol Cell Endocrinol 182, 249-263; Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). 
30 One fmol end-labeled DNA was incubated with an equal amount of reaction mixture (5 

in a binding buffer in a total volume of 20 \x\ for lh. Reactions were resolved by 8% native 
PAGE. The dried gel was visualized and quantified by Phospholmager (Molecular 
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Dynamics). The affinity of CDC in comparison with ERa for the consensus ERE was 
determined by EMSA competition assay. Equal concentrations of functional proteins 
determined by the same amount of bound (shifted) radiolabeled ERE in the preceding 
EMSA were incubated with 0.05 nM of [ 32 P] end-labeled consensus ERE for 30 min. This 
5 was followed by incubation with 0 to 8 nM unlabeled consensus ERE for 30 min. Reactions 
were then resolved by 8% native PAGE. The concentration of unlabeled ERE required to 
reduce ER complex formation with the labeled ERE by 50% (IC 5 o) is proportional to the 
affinity of ER for the unlabeled ERE (Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). 

(4) Missing Nucleoside Hydroxyl Radical Assay (HRA). 
10 235. HRA was performed as described (Yi, P. et al. (2002) Mol Endocrinol 16, 

674-693). In brief, the bottom strand of the double stranded DNA containing ERE was 3' 
end-labeled and randomly cleaved by hydroxyl radical. DNA was then incubated with 
proteins on ice for 30 min. Bound and free EREs were excised from the gel, eluted, and 
equal counts were subjected to 18% sequencing gel electrophoresis. Maxam-Gilbert G- 
15 specific sequencing reactions were performed simultaneously. 

(5) Cell Culture and Transfection. 

236. Culturing and transfection of the mammalian COS-1 and CHO MDA-MB- 
231 cells were described previously (Muyan, M. et al. (2001) Mol Cell Endocrinol 182, 249- 

. 263; Yi, P. et al. (2002) Mol Endocrinol 16, 1810-1827), with the exceptions that 48 well 
20 cell culture plates were used and Trans-TT LT1 transfection reagent (Minis Corp., Madison, 
WT). Luciferase assays were performed with a Dual Luciferase Assay kit (Promega) 
according to the manufacture's recommendations. Steroid hormones and 9-cis retinoic acid 
were purchased from Sigma-Aldrich (St. Louis, MO). For transfection of MDA-MB-231 
and MCF-7 cells, the cells were incubated for three days in a phenol red-free DMEM 
25 supplemented with 10% charcoal-dextran treated fetal bovine serum (CD-FBS). Cells were 
transfected with appropriate expression vectors using Lipofectamine 2000 transfection 
reagent (Invitrogen). 

(6) Cell Cycle Analysis. 

237. MDA-MB-231 and MCF-7 cells were grown as described above. Cells were 
30 then transiently transfected with the EGFP vector bearing a cDNA for ERa, CDC or an 

EBA. Twenty- four hours after transfection, cells were collected by trypsinization, washed 
in phosphate buffered saline (PBS) and fixed with 1% paraformaldehyde in PBS for one 
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hour. Cells were permeabilized in 70% Ethanol overnight and stained with 50 |xg/ml 
propidium iodide (PI, Sigma-Aldrich) for one hour for flow cytometry analysis. Samples 
were analyzed on a FACS EPICS Elite EPS flow cytometer (Coulter Corp., Hialeah, FL). 
The percentages of cells at Gl phase were assessed among treatment groups and expressed 

5 as percent change compared to cells transfected with EGFP vector bearing no cDNA. 

b) RESULTS 

(1) Design of an ERE binding module 
238. ERE binding proteins were engineered as potent activators of ERE- 
containing genes independent of ligand, ER subtypes, dimerization, promoter- and cell-type. 

10 Proteins with various configurations were initially generated to obtain an ERE binding 

module that interacts efficiently with the consensus ERE. ER recognizes an ERE as a dimer 
with interaction primarily mediated by a dimerization interface located in the ligand binding 
domain and through a weak interaction surface in the C domain (Parker, M. G. (1998) 
Biochem Soc Symp 63, 45-50; Hall, J. M. et al. (2001) J Biol Chem 276, 36869-36872). 

1 5 Each C domain of the ER dimer makes analogous contacts with one of the inverted motife, 
resulting in a rotationally symmetric structure (Glass, C. K. (1994) Endocr Rev 15, 391-407; 
Luisi, B. F.et al. (1994) Vitam Horm 49, 1-47). Although the isolated C domain can interact 
with a half-site of the consensus ERE as a monomer, albeit with a substantially lower DNA 
binding affinity than that of the parent receptor, it requires ERB-dependent dimerization for 

20 the formation of a stable C-ERE complex (Schwabe, J. W. et al. (1993) Cell 75, 567-578; 
Nardulli, A. M. et al. (1991 J Biol Chem 266, 24070-24076; Kuntz, M. A. et al. (1997) J 
Biol Chem 272, 27949-279456). Consistent with this, a previous study indicated that 
dimerization of the C domains through an artificial linker enhances binding to ERE 
sequences (Kuntz, M. A. et al. (1997) J Biol Chem 272, 27949-279456). ERs share 97% 

25 amino-acid identity in their DBDs (Mosselman, S.et al. (1996) FEBS Letters 392, 49-53). 
This is reflected in their abilities to bind various ERE sequences with similar specificity and 
affinity by interacting with the same nucleosides (Yi, P. et al. (2002) Mol Endocrinol 16, 
674-693). Thus, the C domain of ERa was chosenas the template protein in the engineering 
of ER subtype-independent ERE binders. The engineered proteins consisted of two C 

30 domains of ERa co-joined without (CC and CCD) or with the D domain as a linker (CDC) 
by the genetic conjugation approach (Fig 1 A) (Muyan, M. et al. (2001) Mol Cell Endocrinol 
182, 249-263). The choice of the D domain as a linker was based on the following 
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observations. First, although the isolated C domain is the m inimal region for ERE binding, 
the immediate amino-terminal residues of the D domain contribute significantly to the DNA 
binding affinity by creating a stable interaction between the half-site-bound monomers 
(Green, S. et al. (1986) Cold Spring Harbor Symposia on Quantitative Biology 51, 751-758; 

5 Schwabe, J. W. et al. (1993) Cell 75, 567-578; Kumar, V. et al. (1987) Cell 51, 941-951; 
Mader, S. et al. (1993) Nucleic Acids Res 21, 1 125-1 132). Second, the D domain, providing 
flexibility between the amino- and carboxyl-termini of the receptor, also contains patches of 
amino-acids that act cooperatively for the nuclear localization of ERa (Picard, D. et al. 
(1990) Cell Regulation 1, 291-299). 

10 239. Analyses by fluorography of ERE binding proteins produced in a 

transcription/translation system in vitro indicated that the fusion cDNAs encode proteins 
with expected M r s and are synthesized at similar molar concentrations (Fig. IB). Only CDC 
comprised of two C domains co-joined by the D domain bound to the consensus ERE 
sequence in EMS A, unless an antibody-directed to the amino-terminal Flag (Fig. 1C) or the 

15 carboxyl-terminus 6xHis epitope was added into the reactions. This indicates that the 
bivalency of the antibody tethers two C-domains in a dimer configuration that allows the 
protein to bind to ERE (Kuntz, M. A. et al. (1997) J Biol Chem 272, 27949-279456). Thus, 
the covalent conjugation of two C domains brought to close proximity by the D domain was 
the optimal approach to allow CDC to bind efficiently to the consensus ERE. 

20 (2) Biochemical properties of the ERE binding module. 

240. CDC, as ERa, did not interact with the DNA fragment bearing an ERE half 
site, 5'-GGTCA-3' (Fig. ID) (Muyan, M. et al. (2001) Mol Cell Endocrinol 182, 249-263; 
Klinge, C. M. et al. (1997) Nucleic Acids Res 25, 1903-1912). Substitutions of two of zinc- 
coordinating Cys with His residues of the first zinc-finger in the C domain of ERa prevent 

25 the receptor from interacting with DNA (Kumar, V. et al. (1988) Cell 55, 145-156). CDC 
mutants with the equivalent mutation in the first (C*DC), second (CDC*) or both (C*DC*) 
C domains of the protein were also generated to address whether both DNA binding 
domains are required for CDC to bind to ERE. Although synthesized at levels comparable 
to those of the parent proteins as assessed by fluorography (Fig. IE), the DNA binding 

30 defective mutant C*DC, as other CDC mutants, and the mutant ERa*, showed no ERE 
binding irrespective of the addition of the flag antibody (Fig. IF). 
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241 . Moreover, the recombinant CDC and ERa occupied both half-sites of the 
consensus ERE by contacting with the same nucleosides (Fig. 1 G). This was revealed by 
the missing nucleoside hydroxyl radical assay (HRA) which allows the analysis of DNA- 
protein interaction at a single-nucleotide resolution (Yi, P. et al. (2002) Mol Endocrinol 16, 

5 674-693; Hayes, J. J. et al. (1989) Biochemistry 28, 9521-9527). HRA is based on the 

expectation that if a base important for binding were missing in a particular DNA molecule, 
the protein binding affinity would be adversely affected. Both CDC and ERa interacted 
with the same nucleosides of the consensus ERE with similar strengths. Thus, the monomer 
CDC binds to ERE in a manner similar to the dimer ERa. 

10 242. The competition assay revealed that recombinant CDC bound to ERE with an 

apparent Kd of 1.49 ± 0.16 nM as estimated by EMSA (Fig. 1H). Although the DNA 
binding affinity of CDC is six-fold lower than that of ERa (K* of 0.26 ± 0.03 nM), CDC 
effectively competed with ERa for binding to ERE (Fig. II). These results collectively 
indicate that CDC can serve as an effective ERE binding module for the engineering of 

15 ERE-binding transactivators. 

(3) Engineering and functional analysis of ERE binding 
activators. 

243. To generate potent regulators of ERE-containing genes, an ERE binding 
protein with activation function, designated as the ERE binding activator (EBA), was 

20 produced. The strong activation domain (AD) of the herpes simplex virion protein, VP1 6 
(Triezenberg, S. J. et al. (1988) Genes Dev 2, 718-729; Sadowski, I. et al. (1988) Nature 
335, 563-564), or the p65 subunit of the nuclear factor kB, NFkB, (Moore, P. A. et al. 
(1993) Mol Cell Biol 13, 1666-1674) as a single copy was genetically fused to the amino- 
(VP16-CDC or p65-CDC) or carboxyl-terminus (CDC-VP16 or CDC-p65) of CDC. The 

25 effectiveness of these EBAs to activate gene transcription in ER-negative African green 

monkey kidney epithelial (COS-1) cells was tested. Cells were transiently transfected with 
an expression vector without or with a cDNA for ERa or for an EBA, together with a 
reporter plasmid bearing none, one or two consensus ERE sequences, placed at the upstream 
of a simple TATA box promoter that drives the firefly luciferase cDNA. Also co- 

30 transfected were cells with a reporter plasmid bearing a cytomegalovirus (CMV) promoter 
driving the expression of renilla luciferase cDNA for monitoring transfection efficiency. 
Normalized activity from each reporter was compared to the basal activity from the reporter 
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construct bearing one ERE in response to the parent expression vector (pM 2 ) in the absence 
of ligand, which was set to one. 

244. CDC-VP1 6 (or p65-CDC), just as ERa, localized to the nucleus in the 
absence (Fig. 2 A) or presence of an ER-ligand, while the parent vector produced no 
intracellular staining. The nuclear staining is protein-type dependent, because the truncated 
ERa variant (EF) that lacks the A/B region along with the C and D domains showed both 
cytoplasmic and nuclear staining (Yi, P. et al. (2002) Mol Endocrinol 16, 674-693). 

245 . Expression of EB As had no effect on gene transcription from the TATA box 

promoter bearing no ERE. ERa increased luciferase activity two-fold in response to a 

physiological concentration, 10' 9 M, of E2 from one ERE (1 x ERE) (Fig. 2B, see also Table 

1). Similarly, CDC-VP16 (or p65-CDC, and see below) induced the.reporter enzyme 

activity about four-fold in the absence or presence of E2. However, both E2-ERa and 

CDC-VP16 (shown in the absence of E2) augmented enzyme activity synergistically from 

two EREs (2 x ERE). 

Table 1 . Effects of increasing number of ADs and of 
EREs on EBA-driven transactivations 



Construct 


1 x I 


2x 


Vecto 


1.0 


+ 


0.0 


1.1 


+ 


0.1 


ERa 


2.0 


+ 


0.3 


28. 


+ 


2.4 


VP16- 


1.8 


+ 


0.3 


8.8 


+ 


0.5 


CDC- 


3.7 


+ 


0.5 


57. 


+ 


5.0 


VP16-CDC- 


13. 


+ 


2.0 


17 


+ 


20. 


(VP16) 2 - 


9.3 


+ 


1.5 


11 


+ 


7.7 


CDC- 


27. 


+ 


4.2 


23 


+ 


24. 


p65- 


2.4 


+ 


0.4 


43. 


+ 


5.2 


CDC- 


3.9 


+ 


0.8 


79. 


+ 


5.5 


p65-CDC- 


34. 


+ 


3.9 


67 


+ 


10 


(p65) 2 -CDC 


45. 


+ 


6.2 


51 


+ 


50. 


CDC-(p65) 2 


29. 


+ 


5.0 


39 


+ 


33. 



If the total effect of ER or the activator on responses from tandem ERE sequences was 
greater than the sum of the effects observed with each element independently, the 
transcription was considered synergistic. Moreover, the magnitude of transactivation 
induced by the EBA from the tandem ERE construct was two-fold greater than that 
observed with the E2-ERa complex. CDC or VP 16 alone did not alter the transcription. 
Similarly, the ERE binding deficient mutant C*DC-VP16 or the CDC-VP16* variant with 
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an impaired activation function due to a point mutation in the AD of VP 16 (Cress, W. D. et 
al. (1991) Science 251, 87-90) had a dramatically reduced activation potency. Collectively, 
these results indicate that the ability of the EBA to induce gene transcription from an ERE is 
dependent on the structural integrity of the DNA binding domains of CDC and on the 
5 presence of an AD. Thus, the CDC with an AD can furnish the necessary scaffold for the 
engineering of potent EBAs. 

(4) Effects of increasing the number of a single AD. 

246. Since an EBA with one copy of an AD augmented reporter activity 
synergistically from tandem ERE sequences (ERE-dependent synergy), it was asked whether 

10 an EBA with two copies of an AD induces transacti vation from one ERE at levels observed 
from two EREs. An EBA with a single copy of an AD fused to both termini, or with two 
copies of an AD placed at the amino or carboxyl-terminus was generated and tested its 
ability to induce gene transcription from one ERE in transfected COS-1 cells. Activators 
with one AD of VP16 or of p65 at each terminus (VP 1 6-CDC-VP 1 6 or p65-CDC-p65) or 

15 two copies at either terminus [(VP16) 2 -CDC or CDC-(VP16) 2 , (p65) 2 -CDC or CDC-(p65) 2 ] 
enhanced the reporter activity dramatically compared to those with a single AD (Table 1). 
These results indicate that in addition to the synergy observed with the tandem ERE 
sequences, the presence of two ADs fused as single copies to both domains or as tandem 
copies to either domain can also synergize from one ERE (domain-dependent synergy). 

20 This is reminiscent of the synergistic activation observed between the AF-1 and AF-2 

domains of ERa which show minimal activation potential when they are expressed alone 
(Yi, P. et al. (2002) Mol Endocrinol 16, 1810-1827; Benecke, A. et al. (2000) EMBO 
Reports 1, 151-157; Kraus, W. L. et al. (1995) Proc Natl Acad Sci USA 92, 12314-12318). 
Interestingly, CDC-(VP16) 2 had higher transcription potency than that observed with VP 16- 

25 CDC-VP16 or (VP16VCDC. This result indicates that the AD of VP16 displays a position 
dependency in inducing transcription, being more effective when it is placed at the carboxyl- 
terminus of CDC. This is similar to observations with the zinc-finger-based activators with 
a single AD of VP-16 (Zhang, L. et al. (2000) J Biol Chem 275, 33850-33860; Liu, P. et 
al.(2001) J Biol Chem 276, 11323-1 1334). This was also the case for EBAs with the AD of 

30 p65. While CDC-(p65) 2 and p65-CDC-p65 showed similar potencies, (p65) 2 -CDC was the 
most potent activator. Remarkably, EBAs with two ADs positioned at either terminus or 
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both termini also retained their abilities to induce transcription synergistically from tandem 
EREs (Tablel). 

247. Thus, an increase in the number of ADs further enhances the biopotency of 
anEBA. 

5 248 . Increases in the copy number of an AD further augmented the biopotency of 

the activator. An EBA bearing three [VP16-CDC-(VP16)2 or (VP16)2-CDC- VP16] or four 
copies [(VP16) 2 -CDC-(VP16) 2 ] of VP16-AD synergistically increased the reporter enzyme 
activity compared to EBAs with two copies of ADs of VP 16 positioned at either or both 
termini (Fig. 3). Similar results were observed with the AD of p65 conjugated EBAs (Fig. 

10 3). Thus, increasing the number of an AD provides the EBA with more transcriptional 
strength. 

(5) Effects of promoter-type on the biopotency of an EBA. 

249. It is also evident that EBAs with p65-AD are more potent activators 

1 5 compared to those bearing the VP 1 6-AD when tested with an ERE-driven TATA box 
promoter (Fig. 3). Since the promoter context can affect the ability of a transactivator to 
induce gene transcription, also tested was whether the type of an AD would differentially 
alter the ability of EBAs to induce transcription from the TK promoter without or with a 
juxtaposed consensus ERE (ERE-TK). EBAs had no effect on transcription from TK 

20 promoter bearing no ERE. EBAs, on the other hand, potently induced transcription with 
increasing magnitude as a consequence of an increase in the number of ADs. Interestingly, 
the AD of VP- 16, in contrast to findings observed with the TATA box promoter, was more 
potent than that of p65 (Fig. 3) when tested with TK promoter-driven reporter construct. 
These results indicate that the type of a promoter can affect the transcriptional strength of an 

25 AD species. 

250. The recruitment of some components of the basal transcription machinery 
that includes TFTJB, TFEED and DRIP/ARC complex by the AD of VP-16 appears to be the 
initial event for establishing a functional transcriptome (Stringer, K. F. et al. (1990) Nature 
345, 783-786; Lin, L. et al. (1991) Biochimica etBiophysica Acta 1082, 177-184; Rachez, 

30 C. et al. (1999) Nature 398, 824-828). Whereas, the ability of the AD of p65 to stably 

interact with the pi 60 family of co-factors and with p300/CBP provides the complex with 
the potent transactivation potential (Gerritsen, M. E. et al. (1997) Proc Natl Acad Sci U S A 
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94, 2927-2932; Uesugi, M. et al. (1997) Science 277, 1310-1313). It was reasoned that an 
EBA with ADs of VP 16 and p65 at either terminus can circumvent the limitations on the 
biopotency of an EBA imposed by the promoter-type through a functional integration of 
diverse co-factors/regulators in comparison to those recruited by a single AD species. 

5 Indeed, VP16-CDC-p65 or p65-CDC-VP16 activated transcription from both the TATA 
box and the TK promoter constructs bearing one ERE more potently than VP16-CDC-VP16 
(Fig. 3). p65-CDC-VP16, on the other hand, displayed more transcriptional strength 
compared to p65-CDC-p65 or VP16-CDC-p65. 

25 1 . Since an increase in diversity of ADs further augments the transcriptional 

10 potency of an activator, whether EBAs bearing ADs of both VP 16 and p65 with increasing 
numbers can generate more potent transactivators than those with a single copy of each AD 
species was investigated. Genetic conjugation of VP- 16 and/or of p65 to the amino- and/or 
carboxyl-terminus of VP16-CDC-p65 or of p65-CDC-VP16 led to a further enhancement in 
the transcriptional potencies of EBAs as a result of an increase in the number of ADs. 

15 (VP16) 2 -CDC-(p65) 2 and (p65) 2 -CDC-(VP16) 2 were the most potent activators, having 

capabilities of enhancing the reporter enzyme activity from either one (Fig. 3) or two ERE- 
driven reporter construct (Table 2) more than 100-fold compared to E2-ERa. Similarly, 
(VP16) 2 -CDC-(p65) 2 and (p65) 2 -CDC-(VP16) 2 were also more potent activators than those 
with a single AD species for inducing transcription from the TK promoter construct. Thus, 

20 an increase in diversity of ADs of EBA circumvents the promoter-context dependent 
limitations on the transcription strength of an AD. 



Table 2. Effects of the cell-type on transactlvation abilities of EBAs 

CHO MDA-MB-231 



Construct 


1 x ERE 


2 x ERE 


1 xERE 


2 x ERE 


Vector 
ER 

(VP16) 2 -CDC- 
(p65) 2 -CDC- 


1.00 + 0.00 
1.76 + 0.14 
41.6 + 1.82 
44.5 + 5.41 


1.23 + 0.12 
10.1+0.50 

162 + 9.18 

163 + 11.8 


1.00 + 0.00 

I. 76 + 0.12 

II. 8 + 3.89 
21.9 + 3.71 


1.03 + 0.08 
39.7 + 10.2 
367 + 100 
349 + 77.6 



(6) Effects of the cell-type on EBA-induced 
25 transactivation. 

252. The type and availability of a myriad of activators, co-factors/integrators and 
components of the general transcriptional machinery determine the cell-specific 
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transcriptional responses. To examine whether EBAs induce transcription more potently 
than E2-ERa independent of the cell-type, EBAs were transfected into ER-negative Chinese 
Hamster Ovary (CHO) and human breast adenocarcinoma (MDA-MB-231) cells (Table 2). 
EBAs induced the reporter enzyme activity potently in all transfected cells. Although the 
5 magnitude of transcription from one or two EREs varied, EBAs induced transcription more 
potently compared to E2-ERa in all cells. 

(7) Transcriptional responses of non-consensus EREs to 
EBAs. 

253. The human pS2 (Nunez, A. M. et al. (1989) EMBO J 8, 823-829), oxytocin 
10 (Richard, S. et al. (1990) J Biol Chem 265, 6098-6103), and lactoferrin (Teng, C. T. et al. 

(1992) Mol Endocrinol 6, 1969-1981) genes confer E2-ERa responsiveness through variant 
ERE sequences 5 '-GGTC AcggTGGC C-3 *, 5'-GGTGAcctTGACC-3' and 5*- 
GGTC AaggCGATC-3 f , respectively; variations from the consensus are underlined. To 
assess whether EBAs induce transcription potently from one non-consensus ERE sequence 
15 placed upstream of the TATA box promoter, COS-1 cells were co-transfected with (VP16) 2 - 
CDC-(p65) 2 or (p65) 2 -CDC-(VP16) 2 (Fig. 4A). Both activators strongly augmented the 
reporter enzyme activity compared to E2-ERot. Thus, EBAs also induce gene transcription 
from non-consensus ERE-driven reporter constructs. 

(8) The functional conservation of sequence recognition by 
20 EBAs. 

254. Glucocorticoid (GR), Progesterone (PR) and androgen (AR) receptors 
recognize hormone response elements (REs) having half-sites, 5-AGAACA-3', similar to 
that of the ERE (Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. 
(1995) Cell 83, 835-839). Thyroid (TR), retinoid X (RXR), zXUrans retinoic acid (RAR) 

25 and vitamin D (VD) receptors, on the other hand, interact with REs with half-sites, 5'- 
AGGTCA-3', identical to that of ERE (Freedman, L. P. (1992) Endocr Rev 13, 129-145; 
Mangelsdorf, D.J. et al. (1995) Cell 83, 835-839). These sites however are arranged in 
direct (RXRE, RARE, VDRE, TRE) or inverted (G/P/A RE) repeats with various spacing 
(Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, D.J. et al. (1995) Cell 83, 

30 835-839). ERE half-sites without spacing also produce a palindromic TRE (Freedman, L. P. 
(1992) Endocr Rev 13, 129-145; Mangelsdorf, D.J. et al. (1995) Cell 83, 835-839). The 
DNA binding regions of these hormone receptors contains two zinc finger-like modules that 
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fold to form a single functional domain. The protein dimer contacts nucleotides in adjacent 
major grooves on one face of the DNA helix containing a response element. Distinct 
residues in a region of the first zinc-finger module of the C domain, P-box, determine the 
DNA binding specificity (sequence discrimination) through a protein-DNA interaction. 
5 Whereas residues in the second zinc-finger-like module, the D-box, are involved in the 
discrimination of half-site spacing through a protein-protein interaction (dimerization) that 
influence the spacing and relative orientation of the two DNA associated monomers 
(Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. (1995) Cell 83, 
835-839). These structural constraints prevent each hormone receptor from binding 
10 promiscuously to REs by the recognition of the central spacing, sequence and arrangement 
of half-sites (Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. 
(1995) Cell 83, 835-839). 

255. If the central spacing between two ERE half-sites is a critical determinant for 
the discrimination of an ERE, EBAs should induce gene transcription from only ERE half- 

15 sites with three central nucleotides that correspond to the consensus ERE, but not from 

sequences with other spacing arrangements. To test this prediction, reporter vectors bearing 
ERE half-sites with various spacing (from 0 to 15 central nucleotides) were generated and 
examined their responsiveness to the E2-ERa complex, (p65) 2 -CDC-(VP16) 2 (Fig. 4B) or 
(VP16) 2 -CDC-(p65) 2 . Results revealed that EBAs, just as E2-ERa induced transcription 

20 from only ERE half-sites separated by three central nucleotides. 

256. Since S-AGGTCATGACCT-^ is also the consensus palindromic TRE 
(Freedman, L. P. (1992) Endocr Rev 13, 129-145; Mangelsdorf, DJ. et al. (1995) Cell 83, 
835-839), the absence of transcription induction from the reporter construct by either ERct 
or EBAs also indicates that the functional conservation of sequence discrimination of EREs 

25 by ERa is also preserved in the EBAs. Similarly, ERa or EBAs did not induce luciferase 
activity from the 5 ! - AGGTC AnAGGTC A-3 ', sequence representing the consensus RXRE. 
In transiently transfected COS-1 cells, ERa, (VP16) 2 -CDC-(p65) 2 or (p65) 2 -CDC-(VP!6) 2 
had no effect on luciferase activity from a reporter plasmid bearing one RXRE; whereas 
RXRcc in response to 10* 6 M 9-cis retinoic acid augmented the enzyme activity compared to 

30 the control (Fig. 4C). 

257. The inverted 5 , -AGAACAnnnTGTTCT-3 , sequence with three non-specific 
central nucleotides is the consensus RE for GR, AR and PR. If the functional 
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discrimination of half-site sequences were conserved, EBAs, as ERa, should be ineffective 
as transactivators of the PRE-driven reporter gene from the TATA box promoter. Indeed, in 
COS-1 cells transfected with an expression vector bearing none (V), a cDNA for PR, ERa, 
(VP16) 2 -CDC-(p65) 2 or (p65) 2 -CDC-(VPl6)2 together with a reporter plasmid bearing a 
5 single PRE, only PR enhanced the reporter enzyme activity when cells were treated with 10' 
8 M Progesterone (Fig. 4D). 

258. These results collectively show that the specificity of the functional 
discrimination of responsive elements by ERa is also conserved in EBAs. 

(9) The effects of EBAs on transcription from estrogen 
10 responsive gene promoter constructs and on the cell cycle 

progression in transfected ER-negative and -positive 
breast cancer cell lines. 

259. EBAs potently modulated transcription from heterologous promoters that 
rely on a synthetic upstream ERE sequence. However, the temporal regulation of E2 

15 responsive gene expression is the result of small changes in the combination and/or 

concentration of transcription factors that bind to specific responsive elements within the 
promoter/enhancer region of a target gene. These factors act synergistically or 
antagonistically with E2-ER that interacts with a single or multiple EREs flanked by various 
responsive elements. For example, the pS2 gene confers ER responsiveness through the 5'- 

20 GGTC AcggTGGCC-3 ' sequence. Positioned between -494 to -393 from the transcription 
initiation site, this ERE is flanked by AP-1, AP-2, CAAT and GC-Boxes to modulate the 
expression of the gene (Nunez, A. M. et al. (1989) EMBO J 8, 823-829). Similarly, C3 is 
also responsive to stimulation by E2, albeit in a cell-context dependent manner (Yi, P. et al. 
(2002) Mol Endocrinol 16, 1810-1827; Sundstrom, S. A. et al. (1990) Endocrinology 126, 

25 1449-1456; Tzukerman, M. T. et al. (1994) Mol Endocrinol 8, 21-30). The E2-ER 

mediation of the C3 gene transcription requires the presence of S'-GGTGGcccTGACC-S* 
sequence surrounded by the regulatory elements for CAAT binding protein, interferon-?, 
interleukin-6, nuclear factor kB and thyroid hormone (Tzukerman, M. T. et al. (1994) Mol 
Endocrinol 8, 21-30). The expression of human oxytocin gene confers E2 responsiveness 

30 through a variant S'-GGTGAcctTGACC-S' ERE sequence flanked by a retinoic acid 

responsive element and several AP-1, SP-1 and CAAT box regulatory elements (Richard, S. 
et al. (1990) J Biol Chem 265, 6098-6103) (Richard, S. et al. (1991) J Biol Chem 266, 
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21428-21433) (Inoue, T. et al. (1994) J Biol Chem 269, 32451-32456)). To ensure that 
EBAs activate transcription of these ERE-containing genes, promoter constructs were 
generated and tested their responsiveness to EBAs in transfected MDA-MB-23 1 cells (Fig. 
5 A). While CDC had little effect on the reporter enzyme levels, ERa, only in the presence 

5 of 10" 9 M E2, p65-CDC-VP16 and (p65) 2 -CDC-(VP16) 2 enhanced the enzyme activity from 
all reporter constructs, the extent of which was dependent upon the promoter-type. The 
DNA binding defective mutants C*DC* and p65-C*DC*-VP16 had no effect on enzyme 
levels. These results indicate that the induction of transcription is dependent upon the 
interaction of transactivators with ERE sequences. 

10 260. A number of E2 responsive genes including collagenase 1 (Col) and retinoic 

acid receptor-a (RARa) are regulated by ERE-independent, DNA-dependent ligand-ER 
signaling (Kushner, P. J. et al. (2000) J Steroid Biochem Mol Biol 74, 311-317; Safe, S. 
(2001) Vitam Horm 62, 231-252). The tethering of the activation domains of ERs to the 
jun/fos family of proteins bound to the AP-1 site in the Col promoter provides the 

15 responsiveness to E2 (Kushner, P. J. et al. (2000) J Steroid Biochem Mol Biol 74, 311-317). 
Similarly, the functional interaction of the activation domains of ERa, but not ERP, with 
the Sp-1 transcription factor bound to the GC box provides the E2 responsiveness for the 
modulation of RARa gene expression (Safe, S. (2001) Vitam Horm 62, 231-252). A 
reporter construct that contains the human Col or RARa promoter was engineered as a 

20 model for the ERE-independent and DNA-dependent signaling pathway. No effect was 
observed of ERa, CDC or an EBA in the absence or presence of E2 on the luciferase 
activity driven by the RARa gene promoter (see below) in transiently transfected MDA- 
MB-23 1 cells (Fig. 5B). On the other hand, the un-liganded ERa enhanced and the E2- 
bound ERa repressed the enzyme activity from Col promoter construct. CDC, p65-CDC- 

25 VP 1 6 or (p65) 2 -CDC-(VP 1 6) 2 had no effect on enzyme levels in the absence or presence of 
E2. These results collectively indicate that EBAs specifically target ERE-containing genes. 

26 1 . Previous studies showed that the ectopic expression of ERa or ERP inhibits 
cell proliferation (Garcia, M. et al. (1992) Proc Natl Acad Sci U S A 89, 1 1538-1 1542; 
30 Wang, M. et al. ( 1 997) Breast Cancer Res Treat 44, 1 45- 1 5 1 ; Wang, M. et al. (1 997) Breast 
Cancer Res Treat 44, 145-151) by arresting the cell cycle at the Gl/S phase transition in 
MDA-MB-23 1 cells (Lazennec, G. et al. (1999) Mol Endocrinol 13, 969-980; Lazennec, G. 
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et al. (1999) Mol Cell Endocrinol 149, 93-105; Lazennec, G. et al. (2001) Endocrinology 
142, 4120-4130). Although it is clear that the restoration of some aspects of hormone 
dependency in ER-negative breast cancer cells alters the cell proliferation, the underlying 
mechanisms remain to be elucidated. Since the mediation of cell proliferation by E2-ER 

5 could involve multiple pathways, that the activation of ERE-containing genes by EBAs 
could account, at least in part, for the repression of the cell cycle progression in MDA-MB- 
23 i cells was addressed. To accomplish this, a vector that bears two expression cassettes 
for the independent synthesis of the enhanced green fluorescent protein (EGFP) and ERct, 
CDC or an EBA was engineered. The vector was transiently transfected into MDA-MB-231 

10 cells. Cells were subjected to a fluorescence-activated cell sorting (FACS) to separate 
EGFP-positive cell population, as described (Shang, Y. et al. (2000) Cell 103, 843-852). 
The cell cycle analysis was simultaneously performed with the EGFP-positive cell 
population. It was observed that while CDC decreased, E2-ERa, p65-CDC-VP16 or (p65) 2 - 
CDC-(VP16) 2 increased cell population accumulated in the Gl phase. On the other hand, 

15 the ERE binding defective mutants, shown for p65-C*DC*-VP16, had no effect on Gl/S 
phase transition. Consistent with previous studies (Garcia, M. et al. (1992) Proc Natl Acad 
SciUS A89, 11538-11542; Wang, M. et al. (1997) Breast Cancer Res Treat 44, 145-151; 
Lazennec, G. et al. (1999) Mol Endocrinol 13, 969-980; Lazennec, G. et al. (1999) Mol Cell 
Endocrinol 149, 93-105; Lazennec, G. et al. (2001) Endocrinology 142, 4120-4130; Wang, 

20 W. et al. (1997) Mol Cell Endocrinol 133, 49-62), these results further implicate ERE- 
containing genes in cell proliferation. The opposite effect of CDC compared to those of 
EBAs and E2-ERa, on the Gl/S phase transition also indicates a different mechanism of 
action on cell proliferation. 

262. An effective ERE targeting by EBAs was also observed in the ER-positive, 

25 hence hormone responsive, MCF-7 cells derived from human breast adenocarcinoma. 

MCF-7 cells were grown for three days in a medium that contains charcoal-dextran treated 
FBS to remove E2 and hence to minimize its effects on endogenous ER. In transiently 
transfected MCF-7 cells, the 10" 9 M E2 treatment enhanced reporter enzyme activity from 
the pS2, C3 and Oxytocin promoter constructs (Fig. 6A). Similarly, p65-CDC-VP16 or 

30 (p65) 2 -CDC-(VP16)2 potently increased luciferase activity in the absence or presence of 10' 9 
M E2 from all constructs. CDC repressed both the basal and E2-induced transcription. As 
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observed in MDA-MB-231 cells, DNA binding defective mutants did not alter enzyme 
levels in the absence or presence of E2. 

263. CDC or EBAs did not alter the transcriptional response from the RARa or 
Col promoter whether or not the cells were treated with 10" 9 M E2, whereas the E2 

5 treatment enhanced the reporter activity from the RARa, but not the Col, promoter (Fig. 
6B). 

264. In contrast to MDA-MB-23 1 cells, the 10' 9 M E2 treatment, as shown 
previously (Lazennec, G. et al. (1999) Mol Endocrinol 13, 969-980; Sutherland, R. L. et al. 
(1983) Eur J Cancer Clin Oncol 19, 307-318; Osborne, C K. etal. (1984) Cancer Res 44, 

10 1433-1439; Zajchowski, D. A. et al. (1993) Cancer Res 53, 5004-501 1; Foster, J. S. et al. 
(2001) Trends Endocrinol Metab 12, 320-327), and EBAs enhanced the Gl/S phase 
transition in MCF-7 cells (Fig. 6C). CDC, on the other hand, repressed the cell cycle 
progression as assessed by an increase in the cell population accumulated in the Gl phase. 
CDC effectively competed with ERa for binding to ERE in vitro (Fig. II) and repressed the 

15 E2-induced transactivation in situ (Fig. 6 A). It is therefore likely that the CDC-mediated 
suppression of the cell cycle progression in the absence of exogenously added E2 is due to 
the prevention of the ERE-bound ER function activated by trace amounts of E2 and/or other 
serum components in the culture medium. To address this issue, MCF-7 cells were cultured 
in a medium that contains untreated FBS. Cells were then transiently transfected with CDC 

20 or C*DC*. CDC dramatically increased the number of cells accumulated in the Gl phase, 
whereas C*DC* had no effect. Similarly, the treatment of cells with 10" 7 M 4- 
hydroxytamoxifen (4-OHT), an anti-estrogenic compound in ER-positive breast cancer cells 
(Jensen, E. V. (1996) Annals of the New York Academy of Sciences 784, 1-17; Jordan, V. C. 
et al. (1999) Endocr Rev 20, 253-278; McDonnell, D. P. et al. (1 999) Trends Endocrinol 

25 Metab 1 0, 301-3 1 1), increased the cell population in the Gl phase. These results are 

consistent with the prediction that the ability of CDC to compete with ERa for an ERE is 
responsible for the repressive effect of the protein on the cell cycle progression mediated by 
the-E2-ER complex. Thus, these findings further confirm the conclusion that the ERE- 
containing gene network is involved in cell cycle regulation, and indicate that EBAs 

30 specifically target ERE sequences of the estrogen responsive genes even in the presence of 
an endogenous E2-ER signaling. 

c) DISCUSSION 
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265. The modular nature of ER allowed us to initially engineer a monomeric ERE 
binding protein by co-joining two C domains with the D domain. The integration of 
increasing numbers of VP-16 and/or of p65 ADs into this module generated constitutively 
active EBAs. These transactivators specifically targeted ERE sequences and robustly 

5 activated ERE-containing genes. The activation potentials of these proteins were 
independent of ligand, ER-subtype, promoter- and cell-context. 

266. E2 is important for the growth and development of the mammary tissue 
(Jensen, E. V. (1996) Annals of the New York Academy of Sciences 784, 1-17; Jordan, V. C. 
et al. (1999) Endocr Rev 20, 253-278; Russo, J. et al. et al. (2000) J Natl Cancer Inst 

10 Monogr, 17-37). E2 is also involved in the proliferation and differentiation of ER-positive 
mammary carcinoma cells (Jensen, E. V. (1996) Annals of the New York Academy of 
Sciences 784, 1-17; Jordan, V. C. et al. (1999) Endocr Rev 20, 253-278; Russo, J. et al. et al. 
(2000) J Natl Cancer Inst Monogr, 17-37). A complex array of convergent and divergent 
mechanisms appear to be involved in the mediation of E2-ER signaling at both genomic and 

15 non-genomic levels (Parker, M. G. (1998) Biochem Soc Symp 63, 45-50; McKenna, N. J. et 
al. (1999) Endocr Rev 20, 321-344; Hall, J. M. et al. (2001) J Biol Chem 276, 36869- 
36872). The genomic effect of E2-ER signaling is mediated through ERE-dependent and - 
independent pathways. These events are responsible for the expression of a number of 
genes involved in the cellular metabolism, mitogenesis, morphogenesis, motogenesis and 

20 apoptosis (Charpentier, A. H. et al. (2000) Cancer Res 60, 5977-5983; Rey, J . M.et al. 

(2000) J Mol Endocrinol 24, 433-440; Soulez, M. et al. (2001) J Mol Endocrinol 27, 259- 
274; Gruvberger, S. et al. (2001) Cancer Res 61, 5979-5984; Inoue, A. et al. (2002) J Mol 
Endocrinol 29, 175-192). However, only a few genes identified to date are shown to be 
mediated by a direct ER-ERE interaction (Klinge, C. M. (2001) Nucleic Acids Res 29, 

25 2905-2919). The ERE binding activators that target only ERE sequences altered the Gl/S 
phase transition in both ER-negative MD A-MB-23 1 and ER-positive MCF-7 cells in a 
manner similar to the E2-ERa complex. Consistent with previous studies (Garcia, M. et al. 
(1992) Proc Natl Acad Sci U S A 89, 1 1538-11542; Wang, M. et al. (1997) Breast Cancer 
Res Treat 44, 145-151; Lazennec, G. et al. (1999) Mol Endocrinol 13, 969-980; Lazennec, 

30 G. et al. (1999) Mol Cell Endocrinol 149, 93-105; Lazennec, G. et al. (2001) Endocrinology 
142, 4120-4130; Wang, W. et al. (1997) Mol Cell Endocrinol 133, 49-62), the results 
disclosed herein further indicate the critical importance of ERE-containing genes in the 
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regulation of cell proliferation. The implications are that these transactivators can be 
utilized in the identification of only ERE-containing E2 responsive genes independent of 
ligand, ER status and subtype, promoter- and cell-context. This allows for better 
understanding of the differences in the mechanisms of genomic responses, which present a 
5 remarkable phenotypic diversity, among breast tumors. This understanding can aid the 
development of strategies for breast cancer treatment. The suppression of ERE-containing 
genes, for example, by the designer ERE-binding repressors can constitute a therapeutic 
approach for ER-positive tumors. Whereas, the activation of the ERE-containing gene 
network by EBAs can provide an effective strategy for the treatment of ER-negative 
10 neoplasms. 

267. The understanding that estrogens are involved in the initiation and 
development of estrogen target-tissue cancers has led to the development of compounds 
with diverse pharmacology that specifically target ER, hence are ER-ligands (Jordan, V. C. 
et al. (1999) Endocr Rev 20, 253-278; McDonnell, D. P. et al. (1999) Trends Endocrinol 

15 Metab 1 0, 301 -3 1 1). ER-ligands display agonistic and/or antagonistic properties depending 
upon the ER subtype, promoter- and cell-context. The binding of an agonist to ER induces 
conformational changes that allow the complex to interact with EREs. The agonist- and 
ERE-bound ER recruits co-regulatory proteins which promote remodeling of chromatin and 
forming a transcription complex. In contrast, antagonists induce conformational changes in 

20 ER that differ from those induced by E2. This prevents ER from interacting with co- 
activators, instead may promote interaction with co-repressors (Shang, Y. et al. (2000) Cell 
103, 843-852; Lavinsky, R. M. et al. (1998) Proc Natl Acad Sci U S A 95, 2920-2925), and 
subsequently inhibits the ability of ER to induce transcription (Jensen, E. V. (1996) Annals 
of the New York Academy of Sciences 784, 1-17; Jordan, V. C. et al. (1999) Endocr Rev 20, 

25 253-278; McDonnell, D. P. et al. (1999) Trends Endocrinol Metab 10, 301-31 1). This 
constitutes the molecular basis of anti-estrogen therapy. However, the eventual 
circumvention of the beneficial effects of ligand-based treatments together with the 
unintended development of other estrogen target tissue malignancies has been a major 
concern for endocrine approaches in the treatment of breast cancer. 

30 268. In addition to ER-ligands, the co-expression of variant ERs with dominant- 

negative phenotype has been proposed to be an effective modality in the functional 
inactivation of wild-type (WT)-ERs and subsequent inhibition of tumor growth (Muyan, M. 
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et al. (2001) Mol Cell Endocrinol 182, 249-263),(Elliston, J. F. et al. (1990) J Biol Chem 
265, 11517-11521; Wang, Y. et al. (1991) Mol Endocrinol 5, 1707-1715; Ince, B. A. et al. 
S. (1993) J Biol Chem 268, 14026-14032; Schodin, D. J. et al. (1995) J Biol Chem 270, 
31 163-31 171; Ma, Z. Q. et al. (1999) J Steroid Biochem Mol Biol 69, 155-163). Although 
5 multiple pathways are involved, the extent of heterodimerization of a variant with the WT- 
ER is thought to correlate with the strength of functional interference (Ince, B. A. et al. S. 
(1993) J Biol Chem 268, 14026-14032; Schodin, D. J. et al. (1995) J Biol Chem 270, 
31163-31 171; Ma, Z. Q. et al. (1999) J Steroid Biochem Mol Biol 69, 155-163). It is 
expected that the heterodimerization leads to the formation of transcriptionally inactive 

10 complexes (Ince, B. A. et al. S. (1993) J Biol Chem 268, 14026-14032; Schodin, D. J. et al. 
(1995) J Biol Chem 270, 31 163-31 171; Ma, Z. Q. et al. (1999) J Steroid Biochem Mol Biol 
69, 155-163). However, heterodimerization also provides characteristics to ER by 
combining two distinct functional properties of the contributing partners (Muyan, M. et al. 
(2001) Mol Cell Endocrinol 182, 249-263). This combinatorial diversity, hence 

15 heterogeneity, of ERs is an obstacle for the use of variants as an effective means of 
regulating estrogen signaling. 

269. The ERE binding proteins disclosed herein preclude the dependency on 
ligand for function and also the limitation of monomer association into biologically active 
dimers. This generates a homogenous population of monomelic ERE binding proteins with 

20 constitutive activity. Moreover, it was observed that the potent activities of EBAs to 

regulate the transcription of ERE-containing genes were preserved in various cell types even 
in the presence of a functional E2-ER signaling, as observed in MCF-7 cells. This presents 
an opportunity for the development of an inducible system that controls the expression of 
ERE binding proteins with desired biopotency at any target cell. This can serve as a basis to 

25 alter the growth of breast tumors independent of ER status. Furthermore, in contrast to 
antiestrogen therapy, such constructs can be targeted to specific tissues of interest by gene 
therapy approaches. 

270. One of the most common features of transcription factors is the Cys2-His2 
zinc-finger (C2H2ZF) domain that functions as the DNA binding module. Each C2H2ZF of a 

30 transfactor recognizes a three-nucleotide site with relative independence from adjacent 

nucleotides. Based on this DNA sequence recognition property, a covalent conjugation of 
various synthetic C2H2ZF components in tandem produces a binding module with 
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remarkable specificity for a pre-determined asymmetric DNA sequence (Pabo, C. O. et al. 
(2001) Annu Rev Biochem 70, 3 13-340; Beerli, R. R. et al. (2002) Nat Biotechnol 20, 135- 
141). Genetic joining of an effector (activation or repression domain) to a C2H2ZF module 
has generated transcription factors for the regulation of expression of specific genes (Zhang, 
5 L. et al. (2000) J Biol Chem 275, 33850-33860; Liu, P. et al.(2001) J Biol Chem 276, 

1 1323-1 1334; Beerli, R. R. et al. (2002) Nat Biotechnol 20, 135-141; Kim, J. S. et al. (1997) 
Proc Natl Acad Sci U S A 94, 3616-3620; Kang, J. S. et al. (2000) J Biol Chem 275, 8742- 
8748). C2H2ZF based transregulators are powerful tools for improving the understanding of 
a specific gene function through transcriptional activation/repression. They are also 

10 valuable for gene therapy wherein the potentiation or attenuation of a single gene function 
can have therapeutic benefits. However, the requirement for a predetermined DNA 
sequence necessitates the engineering of a C2H 2 ZF-transregulator for each distinct sequence. 
This limits the identification of genes driven by variant regulatory sequences that are 
responsive to the same transcription factor. Here, advantage was taken of the intrinsic 

15 specificity and flexibility of DNA sequence decoding properties of the C domain of ERa 
and generated EBAs with gradient biopotencies that specifically and robustly activate 
various ERE sequence-containing genes. This provides a means to simultaneously regulate 
multiple genes responsible for a phenotype, as in breast cancer. This approach can be 
extended to members of multi-subunit complexes of the steroid/thyroid hormone receptor 

20 superfamily and non-ligand dependent transcription factors, such as jun/fos family, which 
operate primarily as homo- and/or hetero-dimers. Targeted regulation of endogenous genes 
by specific responsive element binders with activator or repressor functions can provide 
alternative/additional paradigms to experimental biology and medicine. 

271 . EBAs induce gene transcription synergistically. An increase in the number 

25 and/or diversity of ADs allowed an EBA to induce synergy from a single ERE, hence the 
AD-dependent synergy. Also found was that an EBA retained its ability to induce synergy 
from tandem ERE sequences compared to a single ERE, the ERE-dependent synergy. 
Transcription is a dynamic event in which sequential cycles of association and dissociation 
of co-factor/mediator complexes regulate transcription of target genes through 

30 acetylation/de-acetylation processes and subsequent modifications of local chromatin 
architecture (Chen, H. W. et al. (1999) Cell. 98, 675-686). The synergy mechanism can 
involve an increase in the local concentrations of a co-factor/mediator that interacts with 
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multiple copies of a single AD species bearing EBA bound to an ERE, or diverse co-factors 
associated with an EBA with different ADs. Co-regulators with different histone modifying 
activities show different preferences for free histones and distinct targets within the histone 
substrates (Strahl, B. D. et al. (2000) Nature 403, 41-45). An increase in the concentration 
5 of a co-factor(s) can form a scaffold necessary for an mter-molecular interaction that 
stabilizes the transcription complex at the promoter or can modify local chromatin 
architecture extensively. Either or both events can lead to an extended transcriptional 
initiation and re-initiation and subsequent synergy. Similar mechanisms can also be 
responsible for the synergy observed with tandem ERE sequences. The capacity of the EBA 
10 proteins to activate gene transcription with a gradient as a result of an increase in the 
number and/or diversity of ADs allows mechanisms of transcription in general and of 
transcriptional synergy by defined components in experimental systems to be addressed. 
2. Example 2 

272. Expression of estrogen responsive genes in a spacio-temporal manner is the 

1 5 result of small changes in the combination and/or concentration of transcription factors that 
bind to specific response elements within a promoter/enhancer region of the target gene. 
These factors act synergistically or antagonistically with the E2-ER complex that interacts 
with a single or multiple estrogen responsive elements (EREs) among binding sites for other 
transcription factors. For example, the expression of Cathepsin D (CatD), a lysosomal 

20 proteinase, is induced by estrogen in mammary cancer cells through a non-consensus ERE 
surrounded by several AP-1, Spl and MLPE response elements in the proximal enhancer 
region of the gene (Augereau P. et al. Mol Endocr 8:693-703; Cavailles V. et al. (1993) Proc 
Natl Acad Sci U S A 90:203-7). ERE sequences and the other response elements acting in 
concert are required for the estrogen responsiveness (Augereau P. et al. Mol Endocr 8:693- 

25 703; Cavailles V. et al. (1993) Proc Natl Acad Sci U S A 90:203-7). 

273. In addition to non-hormonal transcriptional elements, presence of both ERE 
and Progesterone response element (PRE) within the proximal promoter of the same gene 
can permit the convergent as well as divergent regulation by E2 and Progesterone (Kraus 
W.L. et al. (1993) Endocrinology 132:2371-9; Katzenellenbogen BS (2000) J Soc Gynecol 

30 Investig 7:S33-7). While ERE and PRE act synergistically to enhance the transcription of 
vitellogenin gene n, PRE in the promoter region of the human c-fos gene antagonizes the 
activity of ERE. Thus, although the E 2 -ER-ERE complex plays a critical role in the 
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modulation of estrogen responsive genes as an enhancer, combinatorial effects of many 
regulatory components specific to each gene ultimately determine the timing, direction and 
magnitude of expression. This contrasts to experimental systems wherein the transcriptional 
activity of a reporter gene is solely dependent on ER as a transcription regulator. Therefore, 
5 an altered state of different signaling pathways converging onto an estrogen-responsive 
gene, as in CatD and C3, can effectively circumvent the ERE-modulated transcription. 
Since both anti-estrogens and dominant-negative receptor variants target ER-mediated 
signaling, aberrant signaling pathways converging onto ER and/or ER-mediated events can 
circumvent the regulatory potential of the ligand-ER. This, in turn, can contribute to tumor 

10 progression and acquisition of resistance to anti-estrogen. This is also consistent with recent 
reports that a dominant negative ERct that bears a potent suppressor does not alter the basal 
transcriptional activity of the endogenous estrogen responsive genes but it decreases E2- 
augmented expression, presumably as a result of heterodimerization with the WT receptor 
(de Haan G. (2000) J Biol Chem 275:13493-501). 

15 274. Since the ERE binding protein serves as an effective platform for the 

generation of potent molecular "on" switches for ERE-driven genes independent of ligand, 
dimerization, ER subtype, promoter- and cell-context, it was reasoned that CDC can also be 
used to generate potent "off 1 switches for the expression of these genes. This would provide 
an active repression of the responsive genes in contrast to the prevention of ER-mediated 

20 transactivation by antiestrogenic compounds. 

275. The acetylation-state of chromosomal histones correlates with transcriptional 
status. Hyperacetylated regions of chromatin frequently contain active transcription units, 
while hypoacetylated chromatin is transcriptionally silent. The enzymatic activities of both 
histone acetyl transferases (HATs) and histone deacetylases (HDACs) ultimately 

25 responsible for the relative levels of histone acetylation, hence the extent and direction of 
gene transcription (Struhl K (1998) Genes & Dev 12:599-606; Strahl, B. D. et al. (2000) 
Nature 403,41-45). 

276. The Kruppel associated box (KRAB) domain is an amino acid sequence 
motif found at the amino-terminus of nearly one-third of all Kruppel/TFIIIA-type C2H2 zinc 

30 finger proteins (Bellefroid EJ. et al. (1991) Proc Natl Acad Sci USA 88:3608-12). This 
highly conserved domain displays potent, DNA-binding dependent repression of 
transcription that requires the KRAB-associated protein- 1 (KAP-1) as the co-repressor 



— 104 — 



WO 2005/033291 



PCT7US2004/032561 



(Margolin JF et al. (1994) Proc Natl Acad Sci USA 91 :4509-13; Friedman JR et al. (1996) 
Genes Dev 10:2067-78; Moosmann P. et al. (1996) Nucleic Acids Res 24:4859-67; 
Moosmann P. et al. (1997) Biol Chem 378:669-77). The KAP-1 co-repressor in turn 
recruits the wwcleosome remodeling histone rfeacetylase complex (NuRD). The NuRD 
5 complex is composed of three functionally distinct proteins, Mi-2, MTA-2 and MBD3 that 
assemble through multiple /wter-molecular interactions to form the complex. Mi-2 
possesses an ATP-dependent chromatin remodeling activity and anchors the NuRD complex 
to the core HDAC1 and HDAC2 complexes that deacetylate histones. The methyl-DNA- 
binding protein MBD3, on the other hand, appears to mediate the ability of NuRD to repress 

10 the transcription in a methylation-dependent manner. The subsequent interaction of the 
NuRD-HADC complex with HP1, a family of nonhistone heterochromatin-associated 
proteins, results in the formation of a heterochromatin-like complex, leading to gene 
silencing. All members of the KRAB domains repress transcription when tethered to DNA. 
Fusions of the KRAB domain derived from KOX1/ZNF10 zinc finger protein to the DNA 

15 binding domains of both LexA and Gal4 factors, for example, effectively silence the 

transcription of reporter genes in transfected cells regardless of the distance of the cognate 
binding element to the transcription initiation site. In addition to the recruitment of co- 
repressor complexes by the fusion proteins to the promoter, specific inhibition of some 
component(s) of RNA polymerase H and m transcription by KRAB appears also involved in 

20 transcription suppression (Moosmann P. et al. (1996) Nucleic Acids Res 24:4859-67; 
Moosmann P. et al. (1997) Biol Chem 378:669-77). 

277. Transcriptional repression by the basic helix-loop-helix zipper protein Madl 
requires DNA binding as a ternary complex with its heterodimer partner Max and the 
mammalian co-repressor protein Sin3. Similarly, the repression of thyroid hormone 

25 responsive genes is mediated by the recruitment of Sin3 repressor complex by the 

unliganded TR-corepressor N-CoR/SMRT complexes. The Sin3 complex contains two 
additional proteins, SAP 18 and SAP30. These proteins are integral components of the Sin3 
complex. They appear involved in the facilitation or stabilization of co-repressor 
interactions with Sin3 and HDACs. The interaction between Madl and Sin3 is mediated by 

30 a 35 amino-acid region in the amino-terminus of Madl (Sin3 interaction domain, or SID) 
(Kasten MM. et al. (1996) Mol Cell Biol 16:4215-21; Ayer DE et al. (1996) Mol Cell Biol 
16:5772-81). The tethering of SID, just as KRAB, to a DBD of a transcription factor 
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suppresses gene transcription mediated by the response element to which the parent 
transcription factor binds. As NuRD complex, Sin3 or SID-mediated transcriptional 
repression involves HDAC1 and HDAC2 recruitment to the promoter. Thus, although 
distinct set of transcriptional repressors recruits different co-repressor complexes to 
5 establish a represosome, the recruitment of HADC1 and HDAC2 by NuRD or Sin3 complex 
is critical for the active gene repression (Kasten MM. et al. (1996) Mol Cell Biol 1 6:4215- 
21; Ayer DE et al. (1996) Mol Cell Biol 16:5772-81). 

278. Since the tethering of a single or multiple copies of the same or different 
ADs to CDC generated EBAs, it was reasoned that fusion of KRAB and/or SID as a single 
10 or multiple copies to CDC should produce ERE-binding repressors (EBRs) that effectively 
suppress transcription of estrogen responsive genes. 

a) RESULTS AND DISCUSSION 

(1) Engineering and functional analysis of ERE binding 
repressors. 

1 5 279. To engineer potent repressors of ERE containing genes, the cDNA of the 

repressor domain KRAB domain (RD) of the KOX-1 protein (Aronica, S. M. et al. (1993) 
Mol Endocrinol 7, 743-752) to CDC was generated. This KRAB cDNA encompasses 
residues 1-90 and was produced by PCR using primers based on the published cDNA 
sequence of KOX1 (Thiesen HJ (1990) New Biol 2:363-74) and a human testis cDNA 

20 library (Clontech Laboratories, Palo Alto, CA) as the template. Similarly, the cDNA for 
SID that encodes residues 1-35 of Madl (Ayer DE et al. (1996) Mol Cell Biol 16:5772-81) 
was generated by PCR from the testis cDNA library. The cDNA for KRAB or SID as a 
single copy was genetically fused to the amino- (KRAB-CDC or SID-CDC) or carboxyl- 
terminus (CDC-KRAB or CDC-SID) of CDC to generate EBRs. 

25 280. The effectiveness of EBRs to suppress transcription was then assessed in ER- 

negative COS-1 or CHO cells. Cells were transiently transfected with an expression vector 
without or with a cDNA for ERa or for an EBR, together with a reporter plasmid bearing 
none or one consensus ERE sequence juxtaposed to the complex SV40 promoter that drives 
the expression of firefly luciferase cDNA as the reporter. Cells were also transfected with a 

30 plasmid bearing a cytomegalovirus (CMV) promoter driving the expression of the renilla 
luciferase cDNA for monitoring transfection efficiency. Normalized activity from each 
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reporter was compared to the basal activity from the reporter construct bearing no ERE in 
response to the parent expression vector (V) in the absence of ligand, which was set to one. 

281. Results revealed that expression of ERct or EBRs had no effect on gene 
transcription from the SV40 promoter without an ERE. Similarly ERa had little effect on 
5 the enzyme activity in the absence or presence of 10" 9 M E2 or 10" 6 M 4-OHT from the 
SV40 promoter with one juxtaposed ERE. On the other hand, CDC unexpectedly 
suppressed basal transcription about 50% in COS-1 cells whether or not cells were treated 
with E2 or 4-OHT. This contrast to CHO cells in which CDC had little effect on enzyme 
activity in the absence or presence of E2 or 4-OHT. 

10 282. EBRs with one RD specie at either terminus dramatically repressed the 

reporter enzyme activity in COS-1 and CHO cells. While KRAB-CDC or SID-CDC 
decreased the SV40-induced transcription about 50% of the basal transcription, the relative 
repression reached to 80% of the basal enzyme activity with CDC-KRAB or CDC-SID. A 
stronger transcription repression mediated by KRAB or SID positioned at the carboxyl- 

15 terminus of CDC was observed with KRAB or SID placed at the ammo-terminus. This 
indicates that RDs are more effective in repressing transcription when placed at the 
carboxyl-terminus of CDC. These together with the observations that EBR had no effect on 
the reporter gene expression from SV-40 promoter bearing no ERE indicate that the ability 
of an EBR to repress transcription is dependent upon binding to an ERE and the presence of 

20 an RD. Thus, as with the development of molecular "on" switches, the genetic fusion of a 
RD to the ERE binder CDC is also a feasible paradigm in engineering of "off 1 switches for 
the ERE-containing genes. 

(2) Effects of increasing number and/or diversity of RD on 
the ability of an EBR to repress transcription in various 
25 cells 

283. Since an EBR with a single copy of a RD placed at either terminus repressed 
the transcription about 80% of that observed with the parent vector, it was asked whether an 
increase in the copy number of a RD further augments the biopotency of the repressor. To 
address this point, the cDNA of a RD was fused to both termini. KRAB-CDC-KRAB or 
30 SID-CDC-SID further suppressed the reporter enzyme activity compared to an EBR bearing 
single RD at either domain. Thus, increasing the number of a RD provides the EBA with 
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more transcriptional strength. Interestingly, EBRs bearing KRAB at either or both termini 
are more potent repressor than those with the SID. 

284. The initial events in the establishment of a functional repressosome by 
KRAB and SID involve the recruitment of distinct sets of co-repressor complexes. 
5 Integration of these complexes recruited can provide an EBR with stronger transcription- 
repression potency compared to those with single RD specie. Indeed, KRAB-CDC-SDD or 
SID-CDC-KRAB repressed the transcription more potently compared to that observed with 
KRAB-CDC-KRAB or SID-CDC-SID; the repression by either construct reached to 95% of 
that observed in response to the parent vector. 
10 285. Similarly, EBRs effectively repressed transcription of the reporter gene in 

ER-negative CHO cells. Thus, EBRs with diverse RDs can be used as potent repressors of 
ERE-containing genes regardless of ligand, ER subtype and cell-context. 

(3) Effects of EBRs on the expression of estrogen 
responsive genes. 

15 286. EBRs are potent repressors of transcription from a strong heterologous 

promoter, which rely on an upstream consensus ERE sequence. However, most of the 
estrogen responsive genes bear a single or multiple ERE sequences that deviate from the 
consensus with one or more nucleotides. Moreover, as mentioned above, the direction and 
magnitude of estrogen responsive gene expression are dictated not only by the E2-ER 

20 complex but also by the integrated actions of trans-acting proteins. The type and 
availability of these protein components specific to each gene act cooperatively or 
antagonistically with the E2-ER complex to modulate the gene expression. To ensure that 
EBRs repress transcription of estrogen responsive genes as well, reporter constructs were 
engineered bearing the enhancer/promoter region of estrogen responsive gene CatD or C3 

25 that derives the expression of the luciferase cDNA were generated. The expression of CatD 
is augmented by estrogens through a non-consensus ERE core (S'-GGCCGgcTGACC-S', 
deviations are underlined) located between -271 to -259 from the transcription initiation 
site. This ERE is flanked by several AP-1, Spl and MLPE response elements (Augereau P. 
et al. Mol Endocr 8:693-703). The ERE sequence is shown to be required for the estrogen 

30 responsiveness, albeit modestly (0.6-0.9 fold compared to the basal expression) (Augereau 
P. et al. Mol Endocr 8:693-703; Miralles F. et al. Biochem Biophy Res Comm 203:71 1-8). 
Similarly, C3 is also responsive to stimulation by both estrogens and tamoxifen, or its more 
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active analog 4-hydroxy-tamoxifen (4-OHT) (Sundstrom, S. A. et al. (1990) Endocrinology 
126, 1449-1456; Tzukerman, M. T. et al. (1994) Mol Endocrinol 8, 21-30). Tamoxifen and 
4-OHT are the widely used antiestrogenic compounds in the treatment of breast cancer. The 
E2 mediation of transcription requires the presence of a non-consensus ERE, 5- 
5 GGTGGcccTGACC-3' surrounded by regulatory elements for CAAT binding protein, 
interferone-y, interleukin-6, nuclear factor kB, glucocorticoid and thyroid hormones 
(Tzukerman, M. T. et al. (1994) Mol Endocrinol 8, 21-30). 

287. The CatD and C3 promoters were generated by PGR using pCD3542 
(Augereau P. et al. Mol Endocr 8:693-703) and pC3Tl (Fan ID. (1996) Mol Endocrinol 

10 10:1605-16) reporter plasmids as templates, respectively. A fragment of 730 bp [-750 to - 
22, +1 is referred to the first ATG, (Parker, M. G. (1998) Biochem Soc Symp 63, 45-50)] of 
the promoter region of CatD gene or 580 bp [-516 to +61, +1 as the transcription start site) 
of the C3 gene (Tzukerman, M. T. et al. (1994) Mol Endocrinol 8, 21-30)] was generated. 
PCR fragments were then inserted into the reporter plasmid pGL3-Basic linearized with 

1 5 appropriate restriction enzymes. 

288. COS-1 cells were transfected with a reporter construct bearing CatD or C3 
promoter together with the expression vector bearing none (V, as control), cDNA for ERa 
or an EBR. Following a transfection, cells were treated without or with 10" 9 M E2 or 10" 7 M 
4-OHT for 24h. 

20 289. Results revealed that ERa had no effect on the enzyme activity in the 

absence of ligand, while the E2-ERa complex enhanced the transcription, albeit modestly, 
compared to the control. 4-OHT displays also partial agonistic effect on ERa-mediated 
transcriptional responses in a promoter-dependent manner. The 4-OHT partial-agonism is 
manifested as transcriptional responses that are substantially lower than those observed with 

25 E2. 4-OHT did not alter the transcription of the reporter enzyme mediated by ERa, while 4- 
OHT at the same concentration prevented the E2-ERa complex-mediated augmentation on 
the enzyme activity. CDC alone had no effect on enzyme activity whether or not cells were 
treated with ligands. In contrast to effects observed on the S V-40 promoter bearing one 
ERE, KRAB at either or both termini of CDC did not alter the transcription of the reporter 

30 enzyme compared to the control levels. On the other hand, CDC with a SID fused to either 
terminus or both termini decreased the transcription about 60% of that observed with the 
parent vector. There was no further decrease in the enzyme levels in response the EBR with 
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two RDs (KRAB-CDC-SID or SID-KRAB). This indicates that an EBR with SID, but not 
KRAB, fused to both domains is an effective repressor of the CatD promoter-mediated 
transcription. 

290. When transcriptional responses were tested in transfected cells with C3 

5 promoter construct, it was observed that ERa had increased the enzyme activity about two- 
fold compared to the parent vector control (V) in the absence of ligand, while the E2-ERa 
complex enhanced the transcription about 1 1-fold. The 4-OHT-ERa complex also 
augmented the reporter enzyme about three-fold. This contrasted to the response observed 
with the CatD promoter, whose activity was not altered by the 4-OHT bound ERa. 4-OHT 

10 on the other hand, effectively decreased the transcription induction from the C3 promoter 
mediated by the E2-ERa complex. As observed with the CatD promoter, KRAB fused to 
either terminus or both termini of CDC had no effect on the reporter enzyme activity in the 
absence or presence of ligand. Whereas, SID as a single copy fused to either terminus or 
both termini of CDC repressed the reporter enzyme activity substantially. Interestingly, 

15 EBR bearing both RDs as a single copy at either terminus (KRAB-CDC-SID or SID-CDC- 
KRAB) had similar repression biopotency compared to that observed with SID-CDC-SID. 

291 . These results confirm the contention that antiestrogenic compounds target 
solely ER-mediated transcriptional events without altering the basal transcription and 
indicate that EBRs can be used as potent repressors of ERE-containing genes regardless of 

20 ligand and ER-subtype, promoter- and cell-type. 

(4) Infectivity of model cell lines with recombinant 
adenovirus ' 

292. To assess the infectivity of the model cell lines by adenovirus, initially an 
adenovirus that expresses the (3-galactosidase cDNA. Viral plaques were isolated and 

25 propagated to produce lysate containing infectious recombinant virus. The recombinant 
a denoviru s at different multiplicities of infection (MOI) was introduced, (number of virus 
per cell whose titer had been determined with a plaque assay (plaque forming unit, pfii/ml)), 
for 24h into ER-negative MDA-MB-231 cells and ER-positive MCF-7 cells. Cells were 
then-fixed with 1% glutaraldehyde in PBS and in situ P-galactosidase assay was performed 

30 as described (Lim, K. et al. (1 989) BioTechniques 7, 576-579). Results revealed that both 
cell-lines were infected with high efficiency, albeit at different MOIs (Fig. 10). MDA-MB- 
231 cells were infected with 100% efficiency by the recombinant adenovirus at 250 MOI, 
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whereas the MOI for MCF-7 cells was 500, Kinetic studies in both cell lines further showed 
that the color development, as an indication of the p-galactosidase synthesis, reaches 
maximum levels at 18h following infection with an optimal MOL 

293. Assessing the infectivity of LCC cell lines by adenovirus can be performed to 
5 obtain optimal MOIs for subsequent usage. 

(5) Testing the effects of regulatory proteins on the 
expression of endogenous E2 responsive genes 

294. An effective gene expression/delivery to breast cancer cells allows one to test 
the prediction that transregulatory proteins specifically and potently regulate the ERE- 

10 containing E2 -responsive genes in native chromatin context. Among the breast cancer cell 
lines, MDA-MB-231 cells have been widely used as a model for the regulation of the 
Catepsin D (CatD) and c-myc gene expression (Lazennec, G. et al. (2001) Endocrinology 
142, 4120-4130; Lazennec, G. et al. (1999) Mol. Cell. Endocrinol. 149, 93-105; Jeng, M. H. 
et al. (1993) Molecular & Cellular Endocrinology 97, 1 15-123; Jeng, M. H. et al. (1994) 

15 Cancer Lett. 82, 123-128). The E2-ER<x-responsiveness of the CatD gene is mediated by 
non-consensus EREs in its proximal promoter, whereas, the E2 responsiveness of the c-myc 
gene is mediated by the ERE-independent pathway. To effectively address whether EBAs 
activate specifically ERE-driven genes in cells expressing endogenous ERa, we infected 
MDA-MB-231 cells with adenovirus bearing no (CMV), or p65-CDC-VP16 (PV), cDNA 

20 (Fig. 1 1). In infected cells, both E2-ERa and PV enhanced the transcription of the ERE- 
driven CatD gene expression compared to the parent adenovirus bearing no cDNA (CMV) 
about 3-fold, while both constructs had no effect on the house keeping gene p-actin mRNA 
levels, as expected. On the other hand, only E2-ERa, but not P V, repressed mRNA levels 
of c-Myc, whose gene expression is regulated by the DNA-dependent and ERE-independent 

25 pathway, 3 -fold compared to the parent adenovirus. These results are consistent with the 
expectation that ERE-binding transregulators specifically target ERE-driven endogenous 
genes. Importantly, as observed previously (Huang, J. et al. (2004) Mol. Cell. Endocrinol. 
218, 65-78), PV, as the E2 treatment, dramatically decreased cell population accumulated in 
Gl phase, shown are cell cycle histograms in Fig. 12. Thus, PV regulates cell cycle 

30 proliferation in a manner similar to the E2 treatment in adenovirus infected MDA-MB-231 
cells. Importantly, the infection with adenovirus bearing ERE binding defective P*V, had 
no effect on cell cycle progression (data not shown). Consistent with results disclosed 
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herein, these results collectively show that specific targeting of ERE-driven genes is 
necessary and sufficient to alter cell proliferation in cell lines that emulate de novo 
endocrine resistant breast neoplasms. These results also establish the adenovirus gene 
delivery approach as an effective means in cell culture systems. Thus, these findings 
5 provide a basis for the rational that targeted regulation of ERE-driven genes provides a 
means for the treatment of breast cancers. 

295. These results also indicate the feasibility of testing the effectiveness of 
transregulators delivered by recombinant adenoviruses in altering experimental tumor 
growth in animal as translational experimental models. 

10 (6) Tissue -specific expression of transgene delivered by 

adenovirus 

296. The whey acidic protein (WAP) promoter is known to be active in pregnant 
and lactating mammary epithelial cells as well as mammary tumors of mice (Ozturk- 
Winder, R et al. (2002) Cancer Gene Ther. 9, 421-431). Studies have indicated that a 

15 proximal 405 base-pair (bp) fragment of the murine WAP promoter confers mammary- 
specific expression upon a marker gene in transgenic mice (Ozturk- Winder, F. et al. (2002) 
Cancer Gene Ther. 9, 421-431). These observations together with the findings that infection 
of a number of established human mammary and nonmammary cell lines with a retroviral 
vector bearing WAP promoter limit the transgene expression to mammary tumor cell lines 

20 indicate that the WAP promoter is active in human mammary cells and mammary tumor 

cells. These results also imply that the extended proximal WAP promoter can be useful for 
directing therapeutic gene expression to mammary tumors (Ozturk- Winder, F. et al. (2002) 
Cancer Gene Ther. 9, 421-431). 

297. To accomplish breast tissue-specific delivery of genes encoding the disclosed 
25 transregulatory proteins by recombinant adenoviruses, one can exchange the adenovirus 

promoter CMV that derives the expression of the transgene cDNA with the 405 bp promoter 
of the WAP gene using molecular biology approaches we have described (Yi, P. et al. 
(2002) Mol. Endocrinol. 16, 1810-1827; Muyan, M. et al. (2001) Mol. Cell. Endocrinol. 
182, 249-263; Sathya, G. et al. (2002) Mol. Cell. Endocrinol. 192, 171-185). One can then 
30 generate the modified adenoviruses bearing our novel transregulators to infect various cell 
lines derived from breast tissue as well as non-breast tissues. Transregulatory proteins are 
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only expressed in cell lines derived from breast tissue and in experimental breast tumors in 
animal models. 

298. Throughout this application, various publications are referenced. The 
disclosures of these publications in their entireties are hereby incorporated by reference into 
5 this application in order to more fully describe the state of the art to which this pertains. 
The references disclosed are also individually and specifically incorporated by reference 
herein for the material contained in them that is discussed in the sentence in which the 
reference is relied upon. 
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H. Sequences 

I. SEQ ID NO: 1 

AGAACA 

2. SEQ ID NO: 2 
5 AGGTCA 

3. SEQ ID NO: 3 
AGAACAnnnTGTTCT 

4. SEQ ID NO: 4 
AGGTCAnAGGTCA 

10 5. SEQ ID NO: 5 
AGGTCATGACCT 

6. SEQ ID NO: 6 human pS2 gene ERE 

GGTCAcggTGGCC 

7. SEQ ID NO: 7 human oxytocin gene ERE 
15 GGTGAcctTGACC 

8. SEQ ID NO: 8 human lactoferrin gene ERE 
GGTCAaggCGATC 

9. SEQ ID NO: 9 
GGTCAnnnTGACC 

20 10. SEQ ID NO: 10 human angitensinogen gene ERE 
GGGCAtcgTGACC 

11. SEQ ID NO: 1 1 human BCL-2 gene ERE 

GGTCGccaGGACC 

12. SEQ ID NO: 12 human BRCA-1 gene ERE 
25 GGTCAggcTGGTC 

13. SEQ ID NO: 13 human Calbinding-D9 gene ERE 

GGCCGggcTGACC 

14. SEQ ID NO: 14 human cathepsin D gene ERE 

GGCCGggcTGACC 
30 15. SEQ ID NO: 15 human choline acetyltransferase gene ERE 

GGCCAcgaTGACA 

16. SEQ ID NO: 16 human complement-3 gene ERE 
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GGTGGcccTGACC 

17. SEQ ID NO: 17 human cytochorome c oxidase subunit Vila-related protein 
(COX7RP) gene ERE 

GGTCAaggTGACC 

5 18. SEQ ID NO: 18 human estrogen responsive finger protein gene ERE 

GGTCAtggTGACC 

19. SEQ ID NO: 19 human Progesterone receptor gene ERE 
GCAGGagcTGACC Progesterone 

20. SEQ ID NO: 20 human pS2 gene ERE 
10 GGTCAgcgTGGCC 

21. SEQ ID NO: 21 human VEGF gene ERE 
AATCAgacTGACT 

22. SEQ ID NO: 22 human Alu gene ERE 

GGTCAggcTGGTC 

15 23. SEQ ID NO: 23 ERa: (Nucleotide Sequence Accession Number: M12674); DNA 
Binding Domain Amino Acid Sequence 

TRYCAVC^YASGYHYGWSCEGCKAFFKRSIQGHNDYMCPATNQCTroKNEtRK 
SCQACRLRKCYEVGM 

20 24. SEQ ID NO: 24 ERa Hinge Domain Amino Acid Sequence 

MKGGIRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAANLWPSPLMQCRS 

25. SEQ ID NO: 25 ERp: (Nucleotide Sequence Accession Number: NM_001437): 
DNA Binding Domain Amino Acid Sequence 

25 DAHFCAVCSDYASGYHYGVWSCEGCKAFFKRSIQGHNDYICPATNQCTIDKNRRK 

SCQACRLRKCYEVGM 

26. SEQ ID NO: 26 ERp Hinge Domain Amino Acid Sequence 

VKCGSRRERCGYRLVRRQRSADEQLHCAG 

30 

27. SEQ ID NO: 27 Glucocorticoid receptor (GR): (Nucleotide Sequence Accession 
Number: Ml 0901); DNA Binding Domain Amino Acid Sequence 
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PPKLCLVCSDEASGCHYGVLTCGSCKWFKRAVEGQHNYLCAGRI^CITOKJIIRKN 
CPACRYRKCLQAGMNLEA 

28. SEQ ID NO: 28 Hinge Domain Amino Acid Sequence 

5 RKTKKOKGIQQATTGVSQETSENPGNKTIVPATLPQLTPT GR 

29. SEQ ID NO: 29 Androgen receptor (AR): (Nucleotide Sequence Accession 
Number: L29496): DNA Binding Domain Amino Acid Sequence 

PQKTCLICGDEASGCirYGALTCGSCKWFKRAAEGKQKYLCASRNDCTIDKFRRKN 
10 CPS CRLRKC YE AGMTL 

30. SEQ ID NO: 30 Domain Amino Acid Sequence 

GARKLKKLGNLKLQEEGEASSTTSPTEETTQKLTVSHIEGYECQPIFLNVLEAI AR 
Hinge 

15 

31. SEQ ED NO: 31 Progesterone Receptor (PR): (Nucleotide Sequence Accession 
Number: M15716): DNA Binding Domain Amino Acid Sequence 

PQKICLICGDEASGCHYGVLTCGSCKWFKPvAMEGQHNYLCAGRNDCIVDKIRRKN 
CPACRLRKCCQAGMVL 

20 

32. SEQ ID NO: 32 PR Hinge Domain Amino Acid Sequence 

GGRKFKKFNKWVVRALDAVAEPQPVGVPNESQALSQRETFSPGQDIQLIPPL 

33. SEQ ID NO: 33 Thyroid hormone receptor (TR): (Nucleotide Sequence 
25 Accession Number: NM_000461): DNA Binding Domain Amino Acid Sequence 

KDELCWCGDKATGYHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTR 

NQCQECRFKKCrYVGM 

34. SEQ ID NO: 34 TR Hinge Domain Amino Acid Sequence 

30 ATDLVLDDSKPaAKRIO.IEENREKRPvPJEELQK5IGHKPEPTDEEWELIKTVTEAHVA 

TNAQGS 
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35. SEQ ID NO: 35 Retinoic X Receptor (RXR): (Nucleotide Sequence Accession 
Number: NM_002957): DNA Binding Domain Amino Acid Sequence 

TKHICAICGDRSSGKHYGVYSCEGCKGFFKRTVRKDLTYTCRDNKDCLIDKRQRNR 

CQYCRYQKCLAMGM 

5 

36. SEQ H> NO: 36 RXR Hinge Domain Amino Acid Sequence 
KREAVQEERQRGKDRNENEVESTSSANEDMPVERILEAELAVEPKTETYVEANMG 

LNPS 

10 37. SEQ ID NO: 37 Retinoic Acid Receptor alpha (RARa): (Nucleotide Sequence 
Accession Number: NM_000964): DNA Binding Domain Amino Acid Sequence 

rYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQKmWYTCHRDKNCIINKVTRNR 

CQYCRLQKCFEVGMSK 

15 38. SEQ ID NO: 38 RARa Hinge Domain Amino Acid Sequence 

ESVRNDRNKKKKEVPKPECSESYTLTPEVGELffiKVRKAHQETFPALCQLGKYTTN 

NSSEQRVSLDIDL 

39. SEQ ID NO: 39 Vitamin D receptor (VDR) (Nucleotide Sequence Accession 
20 Number: AF026260): DNA Binding Domain Amino Acid Sequence 

RJCGVCGDRATGFHFT^AMTCEGCKGFFRRSMKRXALFTCPFNGDCR1TKI)NRRHC 

QACRLKRCVDIGMMK 

40. SEQ ID NO: 40 VDR Hinge Domain Amino Acid Sequence 

25 EFn.TDEEVQRKRENnLKRKEEEALKDSLPJ>KXSEEQQRI[AILLDAHHKTYDPTYSDF 

CQFRPP 



— 128 — 



WO 2005/033291 



PCT/US2004/032561 



41. SEQ ID NO: 41 Human NF-kappa-B transcription factor p65 subunit 
(Nucleotide Sequence Accession Number: LI 9067); Activation Domain Amino Acid 
Sequence 

PPQAVAPPAPKPTQAGEGTLSEALLQLQFDDEDLGALLGNSTDPAVFTDLASVDNSE 
FQQLLNQGIPVAPHTTEPMLMEYPEAJTRLVTGAQRPPDPAPAPLGAPGLPNGLLSG 

DEDFSSIADMDFSALLSQIS 

42. SEQ ID NO: 42 Cloning vector pVP16 (Nucleotide Sequence Accession 
Number: U89963); VP16 Activation Domain Amino Acid Sequence 

KVAPPTDVSLGDEIJiLDGEDVAMAHADALDDFDIX)MLGDGDSPGPGFTPHDSAPY 

GALDMADFEFEQMFTDALGIDEYGGEFPGIRR 

43. SEQ ID NO: 43 Human Madl (Nucleotide Sequence Accession Number: 
L06895); SID Repression Domain Amino Acid Sequence 

MAAAVRMNIQMLI^AADYLERREREAEHGYASMLP 

44. SEQ ID NO: 44 Human KOX-1 (Nucleotide Sequence Accession Number: 
X52332); KRAB Repression Domain Amino Acid Sequence 

MDAKSLTAWSRTLVTFKDWVDFTREEWKLLDTAQQIVYRNVMLENYKNLVSLG 
YQLTKPDVILRLEKGEEPWLVEREIHQETHPDSETA 

45. SEQ ID NO: 45 Estrogen Receptor a (ERa) amino acid sequence 

MTMTLHTKASGMALLHQIQGNELEPUWQI^ 

GAAYEFNAAAAANAQVYGQTGLPYGPGSEAAAFGSNGLGGFPPLNSVSPSPLMLL 

HPPPQLSPFLQPHGQQWYYLENEPSGYTVREAGPPAFYRPNSDNRRQGGRERLAST 

NDKGSMAMESAKETRYCAVCNDYASGYHYGVWSCEGCKAFFKRSIQGHNDYMC 

PATNQCTIDKM^RKSCQACRLRKCYEVGMMKGGIRKDRRGGRMLKHKRQRDDGE 

GRGEVGSAGDMRAANLWPSPLMIKRSBCKNSLALSLTADQMVSALLDAEPPILYSEY 

DPTPvPFSEASMMGLLTNLADRELVHMPrWAKRWGFVDLTLHDQVHLLECAWLEI 

LMI GL V WRS MEHP VKLLF APNLLLDRNQ GKC VE GM VEEFDMLLATS S RFRMMNLQ 

GEEFVCLKSHLLNSGWTI^STLKSLEEKDHIHRVLDKITDTLfflLMAKAGLTLQQQ 

HQRJLAQLLLILS HIPvHM SNKGMEHLYS MKCKNVVPLYDLLLEMLD AHRLHAPTSR 

GGASVEETDQSHLATAGSTSSHSLQKYYITGEAEGFPATV 

46. SEQ ID NO: 46 ERa nucleic acid sequence 

1 gaattccaaa attgtgatgt ttcttgtatt tttgatgaag gagaaatact gtaatgatca 
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61 ctgtttacac tatgtacact ttaggccagc cctttgtagc gttatacaaa ctgaaagcac 
121 accggacccg caggctcccg gggcagggcc ggggccagag ctcgcgtgtc ggcgggacat 
181 gcgctgcgtc gcctctaacc tcgggctgtg ctctttttcc aggtggcccg ccggtttctg 
241 agccttctgc cctgcgggga cacggtctgc accctgcccg cggccacgga ccatgaccat 

5 301 gaccctccac accaaagcat ctgggatggc cctactgcat cagatccaag ggaacgagct 

361 ggagcccctg aaccgtccgc agctcaagat ccccctggag cggcccctgg gcgaggtgta 
421 cctggacagc agcaagcccg ccgtgtacaa ctaccccgag ggcgccgcct acgagttcaa 
481 cgccgcggcc gccgccaacg cgcaggtcta cggtcagacc ggcctcccct acggccccgg 
541 gtctgaggct gcggcgttcg gctccaacgg cctggggggt ttccccccac tcaacagcgt 

10 601 gtctccgagc ccgctgatgc tactgcaccc gccgccgcag ctgtcgcctt tcctgcagcc 

661 ccacggccag caggtgccct actacctgga gaacgagccc agcggctaca cggtgcgcga 
721 ggccggcccg ccggcattct acaggccaaa ttcagataat cgacgccagg gtggcagaga 
781 aagattggcc agtaccaatg acaagggaag tatggctatg gaatctgcca aggagactcg 
841 ctactgtgca gtgtgcaatg actatgcttc aggctaccat tatggagtct ggtcctgtga 

15 901 gggctgcaag gccttcttca agagaagtat tcaaggacat aacgactata tgtgtccagc 

961 caccaaccag tgcaccattg ataaaaacag gaggaagagc tgccaggcct gccggctccg 
1021 caaatgctac gaagtgggaa tgatgaaagg tgggatacga aaagaccgaa gaggagggag 
1081 aatgttgaaa cacaagcgcc agagagatga tggggagggc aggggtgaag tggggtctgc 
1141 tggagacatg agagctgcca acctttggcc aagcccgctc atgatcaaac gctctaagaa 

20 1201 gaacagcctg gccttgtccc tgacggccga ccagatggtc agtgccttgt tggatgctga 

1261 gccccccata ctctattccg agtatgatcc taccagaccc ttcagtgaag cttcgatgat 
1321 gggcttactg accaacctgg cagacaggga gctggttcac atgatcaact gggcgaagag 
1381 ggtgccaggc tttgtggatt tgaccctcca tgatcaggtc caccttctag aatgtgcctg 
1441 gctagagatc ctgatgattg gtctcgtctg gcgctccatg gagcacccag tgaagctact 

25 1501 gtttgctcct aacttgctct tggacaggaa ccagggaaaa tgtgtagagg gcatggtgga 

1561 gatcttcgac atgctgctgg ctacatcatc tcggttccgc atgatgaatc tgcagggaga 

1 62 1 ggagtttgtg tgcctcaaat ctattatttt gcttaattct ggagtgtaca catttctgtc 

1681 cagcaccctg aagtctctgg aagagaagga ccatatccac cgagtcctgg acaagatcac 
1741 agacactttg atccacctga tggccaaggc aggcctgacc ctgcagcagc agcaccagcg 

30 1 801 gctggcccag ctcctcctca tcctctccca catcaggcac atgagtaaca aaggcatgga 

1861 gcatctgtac agcatgaagt gcaagaacgt ggtgcccctc tatgacctgc tgctggagat 
1921 gctggacgcc caccgcctac atgcgcccac tagccgtgga ggggcatccg tggaggagac 
1981 ggaccaaagc cacttggcca ctgcgggctc tacttcatcg cattccttgc aaaagtatta 
2041 catcacgggg gaggcagagg gtttccctgc cacagtctga gagctccctg gc 

35 

47. SEQ ID NO: 47 ERa DBD nucleic acid sequence 

actcg 

841 ctactgtgca gtgtgcaatg actatgcttc aggctaccat tatggagtct ggtcctgtga 
901 gggctgcaag gccttcttca agagaagtat tcaaggacat aacgactata tgtgtccagc 
40 961 caccaaccag tgcaccattg ataaaaacag gaggaagagc tgccaggcct gccggctccg 

1 02 1 caaatgctac gaagtgggaa tg 

48. SEQ ID NO: 48 ERa hinge domain nucleic acid sequence 

atgaaagg tgggatacga aaagaccgaa gaggagggag 
45 1081 aatgttgaaa cacaagcgcc agagagatga tggggagggc aggggtgaag tggggtctgc 
1 141 tggagacatg agagctgcca acctttggcc aagcccgctc atgatcaaac gctct 
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49. SEQ ID NO: 49 Estrogen Receptor p (ERp) amino acid sequence 

MDKNSPSSLNSPSSYNCSQSI^^ 
5 SNVTNLEGGPGRQTTSPNVLWPTPGHLSPLVVHRQLSHLYAEPQKSPW 

TLP VNRETLKRKV S GNRC ASP VTGPGSKRD AHFC A VCSD Y AS G YHYG VWS CEGCK 
AFFKRSIQGHNDYICPATNQCTIDKNRRKSCQACRLRKCY^ 
RLVRRQRSADEQLHCAGKAKRSGGHAPRVM 
RPSAPFTEASMMMSLTKLADKELV^ 
10 MMGLMWRSIDHPGKLIFAPDLVXDRDEGKCVEGILEIFDM 
YLCVKAMILLNSSMYPLVT^^ 
MRI^NLLMLLSHVRHASNKGMEHLLNMKC 
ITGSECSPAEDSKSKEGSQNPQSQ 

15 50. SEQ ID NO: 50 ERp nucleic acid sequence 

1 tttcagtttc tccagctgct ggctttttgg acacccactc ccccgccagg aggcagttgc 
61 aagcgcggag gctgcgagaa ataactgcct cttgaaactt gcagggcgaa gagcaggcgg 
121 cgagcgctgg gccggggagg gaccacccga gctgcgacgg gctctggggc tgcggggcag 

20 181 ggctggcgcc cggagcctga gctgcaggag gtgcgctcgc tttcctcaac aggtggcggc 

241 ggggcgcgcg ccgggagacc ccccctaatg cgggaaaagc acgtgtccgc attttagaga 
301 aggcaaggcc ggtgtgttta tctgcaagcc attatacttg cccacgaatc tttgagaaca 
361 ttataatgac ctttgtgcct cttcttgcaa ggtgttttct cagctgttat ctcaagacat 
421 ggatataaaa aactcaccat ctagccttaa ttctccttcc tcctacaact gcagtcaatc 

25 481 catcttaccc ctggagcacg gctccatata cataccttcc tcctatgtag acagccacca 

541 tgaatatcca gccatgacat tctatagccc tgctgtgatg aattacagca ttcccagcaa 
601 tgtcactaac ttggaaggtg ggcctggtcg gcagaccaca agcccaaatg tgttgtggcc 
661 aacacctggg cacctttctc ctttagtggt ccatcgccag ttatcacatc tgtatgcgga 
721 acctcaaaag agtccctggt gtgaagcaag atcgctagaa cacaccttac ctgtaaacag 

30 781 agagacactg aaaaggaagg ttagtgggaa ccgttgcgcc agccctgtta ctggtccagg 

841 ttcaaagagg gatgctcact tctgcgctgt ctgcagcgat tacgcatcgg gatatcacta 
901 tggagtctgg tcgtgtgaag gatgtaaggc cttttttaaa agaagcattc aaggacataa 
961 tgattatatt tgtccagcta caaatcagtg tacaatcgat aaaaaccggc gcaagagctg 
1021 ccaggcctgc cgacttcgga agtgttacga agtgggaatg gtgaagtgtg gctcccggag 

35 1081 agagagatgt gggtaccgcc ttgtgcggag acagagaagt gccgacgagc agctgcactg 

1141 tgccggcaag gccaagagaa gtggcggcca cgcgccccga gtgcgggagc tgctgctgga 
1201 cgccctgagc cccgagcagc tagtgctcac cctcctggag gctgagccgc cccatgtgct 
1261 gatcagccgc cccagtgcgc ccttcaccga ggcctccatg atgatgtccc tgaccaagtt 
1321 ggccgacaag gagttggtac acatgatcag ctgggccaag aagattcccg gctttgtgga 

40 1381 gctcagcctg ttcgaccaag tgcggctctt ggagagctgt tggatggagg tgttaatgat 
1441 ggggctgatg tggcgctcaa ttgaccaccc cggcaagctc atctttgctc cagatcttgt 
1 501 tctggacagg gatgagggga aatgcgtaga aggaattctg gaaatctttg acatgctcct 
1561 ggcaactact tcaaggtttc gagagttaaa actccaacac aaagaatatc tctgtgtcaa 
1621 ggccatgatc ctgctcaatt ccagtatgta ccctctggtc acagcgaccc aggatgctga 

45 1681 cagcagccgg aagctggctc acttgctgaa cgccgtgacc gatgctttgg tttgggtgat 

1741 tgccaagagc ggcatctcct cccagcagca atccatgcgc ctggctaacc tcctgatgct 
1801 cctgtcccac gtcaggcatg cgagtaacaa gggcatggaa catctgctca acatgaagtg 
1861 caaaaatgtg gtcccagtgt atgacctgct gctggagatg ctgaatgccc acgtgcttcg 
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1921 cgggtgcaag tcctccatca cggggtccga gtgcagcccg gcagaggaca gtaaaagcaa 
1981 agagggctcc cagaacccac agtctcagtg a 

5 51. SEQ ED NO: 51 ER(3 DBD nucleic acid sequence 

gatgctcact tctgcgctgt ctgcagcgat tacgcatcgg gatatcacta 
901 tggagtctgg tcgtgtgaag gatgtaaggc cttttttaaa agaagcattc aaggacataa 
961 tgattatatt tgtccagcta caaatcagtg tacaatcgat aaaaaccggc gcaagagctg 
1021 ccaggcctgc cgacttcgga agtgttacga agtgggaatg 

10 

52. SEQ ID NO: 52 ER|3 hinge domain nucleic acid sequence 

gtgaagtgtg gctcccggag 

1081 agagagatgt gggtaccgcc ttgtgcggag acagagaagt gccgacgagc agctgcactg 
1141 tgccggc 

15 

53. SEQ ID NO: 53 Glucocorticoid Receptor (GR) amino acid sequence 

MDSKESLTPGREENPSSVLAQERGDVM^ 

20 KQRRLL\HDFPKGSVSNAQQPDLSKA^ 

LSSGETDLKLLEESIANLNRSTSWENPKSSASTAVSAAPTEKEFPKTHSDVSSEQQH 
LKGQTGTNGGNVKLYTTDQSTFDILQDLEFSSGSPGKETNESPWRSDLLroE 
LAGEDDSFLLEGNSNEDCKPLDJPDTKPKIKDNGDLVLSSPSNVTLP 
LCTPGVIKQEKLGTVYCQASFPGAl^GNKMSAISVHGVSTSGGQ 

25 SQQQDQKPIFNWPIPVGSENWNRCQGSGDDNLTSLGTLNFPGRTWSNGYSSPSM 
RPDVSSPPSSSSTATTGPPPKLCLVCSDEASGCHYGVLTCGSCKWFKRAVEGQHNY 
LCAGRNDCITOKIRRKNCPACRYRKCLQAGMNLEARKTKXKK 
SENPGNKTIW ATLPQLTPTLVSLLEVIEPE VLY AGYD S S WDST WRIMTTLNMLGGR 
QVIAAVKWAKAIPGFRNLHLDDQMTLLQYSWMFLM 

30 APDLIINEQRMTLPCMYDQCKHMLYVSSELHRLQVSYEEYLCM^ 
LKSQELFDEIRMTYIKELGKAIVKREGNSSQNWQRFYQ 
CFQTFLDKTMSffiFPEMLAEnTNQIPKYSNGNIKKLLFHQK 



35 54. SEQ ID NO: 54 GR nucleic acid sequence 

1 tttttagaaa aaaaaaatat atttccctcc tgctccttct gcgttcacaa gctaagttgt 
61 ttatctcggc tgcggcggga actgcggacg gtggcgggcg agcggctcct ctgccagagt 
121 tgatattcac tgatggactc caaagaatca ttaactcctg gtagagaaga aaaccccagc 

40 181 agtgtgcttg ctcaggagag gggagatgtg atggacttct ataaaaccct aagaggagga 

241 gctactgtga aggtttctgc gtcttcaccc tcactggctg tcgcttctca atcagactcc 
301 aagcagcgaa gacttttggt tgattttcca aaaggctcag taagcaatgc gcagcagcca 
361 gatctgtcca aagcagtttc actctcaatg ggactgtata tgggagagac agaaacaaaa 
421 gtgatgggaa atgacctggg attcccacag cagggccaaa tcagcctttc ctcgggggaa 

45 481 acagacttaa agcttttgga agaaagcatt gcaaacctca ataggtcgac cagtgttcca 

541 gagaacccca agagttcagc atccactgct gtgtctgctg cccccacaga gaaggagttt 
601 ccaaaaactc actctgatgt atcttcagaa cagcaacatt tgaagggcca gactggcacc 
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661 aacggtggca atgtgaaatt gtataccaca gaccaaagca cctttgacat tttgcaggat 
721 ttggagtttt cttctgggtc cccaggtaaa gagacgaatg agagtccttg gagatcagac 
781 ctgttgatag atgaaaactg tttgctttct cctctggcgg gagaagacga ttcattcctt 
841 ttggaaggaa actcgaatga ggactgcaag cctctcattt taccggacac taaacccaaa 
5 901 attaaggata atggagatct ggttttgtca agccccagta atgtaacact gccccaagtg 

961 aaaacagaaa aagaagattt catcgaactc tgcacccctg gggtaattaa gcaagagaaa 
1021 ctgggcacag tttactgtca ggcaagcttt cctggagcaa atataattgg taataaaatg 
1081 tctgccattt ctgttcatgg tgtgagtacc tctggaggac agatgtacca ctatgacatg 
1141 aatacagcat ccctttctca acagcaggat cagaagccta tttttaatgt cattccacca 

10 1201 attcccgttg gttccgaaaa ttggaatagg tgccaaggat ctggagatga caacttgact 

1261 tctctgggga ctctgaactt ccctggtcga acagtttttt ctaatggcta ttcaagcccc 
1321 agcatgagac cagatgtaag ctctcctcca tccagctcct caacagcaac aacaggacca 
1381 cctcccaaac tctgcctggt gtgctctgat gaagcttcag gatgtcatta tggagtctta 
1441 acttgtggaa gctgtaaagt tttcttcaaa agagcagtgg aaggacagca caattaccta 

15 1501 tgtgctggaa ggaatgattg catcatcgat aaaattcgaa gaaaaaactg cccagcatgc 

1561 cgctatcgaa aatgtcttca ggctggaatg aacctggaag ctcgaaaaac aaagaaaaaa 
1621 ataaaaggaa ttcagcaggc cactacagga gtctcacaag aaacctctga aaatcctggt 
1681 aacaaaacaa tagttcctgc aacgttacca caactcaccc ctaccctggt gtcactgttg 
1741 gaggttattg aacctgaagt gttatatgca ggatatgata gctctgttcc agactcaact 

20 1801 tggaggatca tgactacgct caacatgtta ggagggcggc aagtgattgc agcagtgaaa 

1861 tgggcaaagg caataccagg tttcaggaac ttacacctgg atgaccaaat gaccctactg 
1921 cagtactcct ggatgtttct tatggcattt gctctggggt ggagatcata tagacaatca 
1981 agtgcaaacc tgctgtgttt tgctcctgat ctgattatta atgagcagag aatgactcta 
2041 ccctgcatgt acgaccaatg taaacacatg ctgtatgttt cctctgagtt acacaggctt 

25 2101 caggtatctt atgaagagta tctctgtatg aaaaccttac tgcttctctc ttcagttcct 

2161 aaggacggtc tgaagagcca agagctattt gatgaaatta gaatgaccta catcaaagag 
2221 ctaggaaaag ccattgtcaa gagggaagga aactccagcc agaactggca gcggttttat 
2281 caactgacaa aactcttgga ttctatgcat gaagtggttg aaaatctcct taactattgc 
2341 ttccaaacat ttttggataa gaccatgagt attgaattcc ccgagatgtt agctgaaatc 

30 2401 atcaccaatc agataccaaa atattcaaat ggaaatatca aaaaacttct gtttcatcaa 
2461 aagtgactgc cttaataaga atggttgcct taaagaaagt cgaattaata gcttttattg 
2521 tataaactat cagtttgtcc tgtagaggtt ttgttgtttt attttttatt gttttcatct 
2581 gttgttttgt tttaaatacg cactacatgt ggtttataga gggccaagac ttggcaacag 
2641 aagcagttga gtcgtcatca cttttcagtg atgggagagt agatggtgaa atttattagt 

35 2701 taatatatcc cagaaattag aaaccttaat atgtggacgt aatctccaca gtcaaagaag 
2761 gatggcacct aaaccaccag tgcccaaagt ctgtgtgatg aactttctct tcatactttt 
2821 tttcacagtt ggctggatga aattttctag actttctgtt ggtgtatccc ccccctgtat 
2881 agttaggata gcatttttga tttatgcatg gaaacctgaa aaaaagttta caagtgtata 
2941 tcagaaaagg gaagttgtgc cttttatagc tattactgtc tggttttaac aatttccttt 

40 3001 atatttagtg aactacgctt gctcattttt tcttacataa ttttttattc aagttattgt 

3061 acagctgttt aagatgggca gctagttcgt agctttccca aataaactct aaacattaat 
3121 caatcatctg tgtgaaaatg ggttggtgct tctaacctga tggcacttag ctatcagaag 
31 81 accacaaaaa ttgactcaaa tctccagtat tcttgtcaaa aaaaaaaaaa aaaaagctca 
3241 tattttgtat atatctgctt cagtggagaa ttatataggt tgtgcaaatt aacagtccta 

45 3301 actggtatag agcacctagt ccagtgacct gctgggtaaa ctgtggatga tggttgcaaa 

3361 agactaattt aaaaaataac taccaagagg ccctgtctgt acctaacgcc ctatttttgc 
3421 aatggctata tggcaagaaa gctggtaaac tatttgtctt tcaggacctt ttgaagtagt 
3481 ttgtataact tcttaaaagt tgtgattcca gataaccagc tgtaacacag ctgagagact 
3541 tttaatcaga caaagtaatt cctctcacta aactttaccc aaaaactaaa tctctaatat 
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3601 ggcaaaaatg gctagacacc cattttcaca ttcccatctg tcaccaattg gttaatcttt 
3661 cctgatggta caggaaagct cagctactga tttttgtgat ttagaactgt atgtcagaca 
3721 tccatgtttg taaaactaca catccctaat gtgtgccata gagtttaaca caagtcctgt 
3781 gaatttcttc actgttgaaa attattttaa acaaaataga agctgtagta gccctttctg 
3841 tgtgcacctt accaactttc tgtaaactca aaacttaaca tatttactaa gccacaagaa 
3901 atttgatttc tattcaaggt ggccaaatta tttgtgtaat agaaaactga aaatctaata 
3961 ttaaaaatat ggaacttcta atatattttt atatttagtt atagtttcag atatatatca 
4021 tattggtatt cactaatctg ggaagggaag ggctactgca gctttacatg caatttatta 
4081 aaatgattgt aaaatagctt gtatagtgta aaataagaat gatttttaga tgagattgtt 
4141 ttatcatgac atgttatata ttttttgtag gggtcaaaga aatgctgatg gataacctat 
4201 atgatttata gtttgtacat gcattcatac aggcagcgat ggtctcagaa accaaacagt 
4261 ttgctctagg ggaagaggga gatggagact ggtcctgtgt gcagtgaagg ttgctgaggc 
4321 tctgacccag tgagattaca gaggaagtta tcctctgcct cccattctga ccacccttct 
4381 cattccaaca gtgagtctgt cagcgcaggt ttagtttact caatctcccc ttgcactaaa 
4441 gtatgtaaag tatgtaaaca ggagacagga aggtggtgct tacatcctta aaggcaccat 
4501 ctaatagcgg gttactttca catacagccc tcccccagca gttgaatgac aacagaagct 
4561 tcagaagttt ggcaatagtt tgcatagagg taccagcaat atgtaaatag tgcagaatct 
4621 cataggttgc caataataca ctaattcctt tctatcctac aacaagagtt tatttccaaa 
4681 taaaatgagg acatgttttt gttttctttg aatgcttttt gaatgttatt tgttattttc 
4741 agtattttgg agaaattatt taataaaaaa acaatcattt gctttttg 

55. SEQ ID NO: 55 GR DBD nucleic acid sequence 

cctcccaaac tctgcctggt gtgctctgat gaagcttcag gatgtcatta tggagtctta 
1441 acttgtggaa gctgtaaagt tttcttcaaa agagcagtgg aaggacagca caattaccta 
1501 tgtgctggaa ggaatgattg catcatcgat aaaattcgaa gaaaaaactg cccagcatgc 
1561 cgctatcgaa aatgtcttca ggctggaatg aacctggaag ct 

56. SEQ ID NO: 56 GR hinge domain nucleic acid sequence 

cgaaaaac aaagaaaaaa 

1621 ataaaaggaa ttcagcaggc cactacagga gtctcacaag aaacctctga aaatcctggt 
1681 aacaaaacaa tagttcctgc aacgttacca caactcaccc ctacc 

57. SEQ ID NO: 57 Androgen Receptor (AR) amino acid sequence 

MQLLQQQQQEAVSEGSSSGRAREASGAPTSSKDNYLGGTSTISDNAKELCKAVSVS 

MGLGVEALEHLSPGEQLRGDCMYAPLLGWPAVRPTPCAPLAECKGSLLDDSAGKS 

TEDTAEYSPFKGGYTKGLEGESLGCSGSAAAGSSGTLELPSTLSLYKSGALDEAAAY 

QSRDYYNFPLALAGPPPPPPPPHPHARIKLENPLDYGSAWAAAAAQCRYGDLASL^ 

GAGAAGPGSGSPSAAASSSWHTLFTAEEGQLYGPCGGGGGGGGGGGGGGGGGGG 

GGGGGGGGEAEAVAPYGYTRPPQGLAGQESDFTAPDVWYPGGMVSRVPYPSPTC 

VKSEMGPWMDSYSGPYGDMRLETARDHVLPIDYYFPPQKTCLICGDEASGCHYGA 

LTCGSCKWFKRAAEGKQKYLCASRNDCT1DKFRRKNCPSCRLRKCYE 

RIO.KKLGNLKXQEEGEASSTTSPTEETTQKLTVSHffiGYECQPIFLN^ 

AGHDNNQPDSFAALLSSLNELGERQLVHVVXWAKALPGF 

WMGLMW AMG WRSFTNVNSRMLYF APDLVFNE YR^^ QC VRMRHLS QE 

FGWLQITPQEFLCMKALLLFSIIPVDGLKNQKFFDELRMN^ 
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CSRRFYQLTKLLDSVQPIAR^ 
KVKPIYFHTQ 



5 58. SEQ ID NO: 58 AR nucleic acid sequence 

1 atgcaactcc ttcagcaaca gcagcaggaa gcagtatccg aaggcagcag cagcgggaga 
61 gcgagggagg cctcgggggc tcccacttcc tccaaggaca attacttagg gggcacttcg 
121 accatttctg acaacgccaa ggagttgtgt aaggcagtgt cggtgtccat gggcctgggt 

10 181 gtggaggcgt tggagcatct gagtccaggg gaacagcttc ggggggattg catgtacgcc 

241 ccacttttgg gagttccacc cgctgtgcgt cccactcctt gtgccccatt ggccgaatgc 
301 aaaggttctc tgctagacga cagcgcaggc aagagcactg aagatactgc tgagtattcc 
361 cctttcaagg gaggttacac caaagggcta gaaggcgaga gcctaggctg ctctggcagc 
421 gctgcagcag ggagctccgg gacacttgaa ctgccgtcta ccctgtctct ctacaagtcc 

15 481 ggagcactgg acgaggcagc tgcgtaccag agtcgcgact actacaactt tccactggct 

541 ctggccggac cgccgccccc tccgccgcct ccccatcccc acgctcgcat caagctggag 
601 aacccgctgg actacggcag cgcctgggcg gctgcggcgg cgcagtgccg ctatggggac 
661 ctggcgagcc tgcatggcgc gggtgcagcg ggacccggtt ctgggtcacc ctcagccgcc 
721 gcttcctcat cctggcacac tctcttcaca gccgaagaag gccagttgta tggaccgtgt 

20 781 ggtggtggtg ggggtggtgg tggcggcggc ggcggcggcg gcggcggcgg cggcggcggc 

841 ggcggcggcg gcggcggcgg cgaggcggaa gctgtagccc cctacggcta cactcggccc 
901 cctcaggggc tggcgggcca ggaaagcgac ttcaccgcac ctgatgtgtg gtaccctggc 
961 ggcatggtga gcagagtgcc ctatcccagt cccacttgtg tcaaaagcga aatgggcccc 
1021 tggatggata gctactccgg accttacggg gacatgcgtt tggagactgc cagggaccat 

25 1081 gttttgccca ttgactatta ctttccaccc cagaagacct gcctgatctg tggagatgaa 
1141 gcttctgggt gtcactatgg agctctcaca tgtggaagct gcaaggtctt cttcaaaaga 
1201 gccgctgaag ggaaacagaa gtacctgtgc gccagcagaa atgattgcac tattgataaa 
1261 ttccgaagga aaaattgtcc atcttgtcgt cttcggaaat gttatgaagc agggatgact 
1321 ctgggagccc ggaagctgaa gaaacttggt aatctgaaac tacaggagga aggagaggct 

30 1381 tccagcacca ccagccccac tgaggagaca acccagaagc tgacagtgtc acacattgaa 
1441 ggctatgaat gtcagcccat ctttctgaat gtcctggaag ccattgagcc aggtgtagtg 
1501 tgtgctggac acgacaacaa ccagcccgac tcctttgcag ccttgctctc tagcctcaat 
1561 gaactgggag agagacagct tgtacacgtg gtcaagtggg ccaaggcctt gcctggcttc 
1621 cgcaacttac acgtggacga ccagatggct gtcattcagt actcctggat ggggctcatg 

35 1681 gtgtttgcca tgggctggcg atccttcacc aatgtcaact ccaggatgct ctacttcgcc 
1741 cctgatctgg ttttcaatga gtaccgcatg cacaagtccc ggatgtacag ccagtgtgtc 
1801 cgaatgaggc acctctctca agagtttgga tggctccaaa tcacccccca ggaattcctg 
1861 tgcatgaaag cactgctact cttcagcatt attccagtgg atgggctgaa aaatcaaaaa 
1921 ttctttgatg aacttcgaat gaactacatc aaggaactcg atcgtatcat tgcatgcaaa 

40 1981 agaaaaaatc ccacatcctg ctcaagacgc ttctaccagc tcaccaagct cctggactcc 
2041 gtgcagccta ttgcgagaga gctgcatcag ttcacttttg acctgctaat caagtcacac 
2101 atggtgagcg tggactttcc ggaaatgatg gcagagatca tctctgtgca agtgcccaag 
2161 atcctttctg ggaaagtcaa gcccatctat ttccacaccc agtga 



59. SEQ ID NO: 59 AR DBD nucleic acid sequence 

ccc cagaagacct gcctgatctg tggagatgaa 
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1141 gcttctgggt gtcactatgg agctctcaca tgtggaagct gcaaggtctt cttcaaaaga 
1201 gccgctgaag ggaaacagaa gtacctgtgc gccagcagaa atgattgcac tattgataaa 
1261 ttccgaagga aaaattgtcc atcttgtcgt cttcggaaat gttatgaagc agggatgact 
1321 ctg 

5 

60. SEQ ID NO: 60 AR hinge domain nucleic acid sequence 

ggagccc ggaagctgaa gaaacttggt aatctgaaac tacaggagga aggagaggct 
1381 tccagcacca ccagccccac tgaggagaca acccagaagc tgacagtgtc acacattgaa 
10 1441 ggctatgaat gtcagcccat ctttctgaat gtcctggaag ccatt 



61. SEQ ID NO: 61 Progesterone Receptor (PR) amino acid sequence 

15 MTELKAKGPRAPHVAGGPPSPEVGSPLLCRPAAGPFPGSQTSDTLPEVSAIPISLDGL 
LFPRPCQGQDPSDEKTQDQQSLSDVEGAYSRAEATRGAGGSSSSPPEKDSGLLDSVL 
DTLLAPSGPGQSQPSPPACEVTSSWCLFGPELPEDPPAAPATQRVLSPLMSRSGCKV 
GD S SGTAAAHKVLPRGI^P ARQLLLP ASESPHWSGAP VKPSPQ AAA VE VEEEDGSE 
SEESAGPLLKGKPRALGGAAAGGGAAAVPPGAAAGGVALVPKEDSRFSAPRVALV 

20 EQDAPMAPGRSPLATTVMDFIHVPILPLNHALLAARTRQLLEDESYDGGAGAASAF 
APPRSSPCASSTPVAVGDFPDCAYPPDAEPKDDAYPLYSDFQPPALKJKEEEEGAEA 
SARSPRSYLVAGANPAAFPDFPLGPPPPLPPRATPSRPGEAAVTAAPASASVSSASSS 
GSTLECILYKAEGAPPQQGPFAPPPCKAPGASGCLLPRDGLPSTSASAAAAGAAPAL 
YPALGLNGLPQLGYQAAVLKEGLPQVYPPYLNYLRPDSEASQSPQYSFESLPQKICLI 

25 CGDEAS GCHYGYLTCGS CKWFKRAMEGQHNYLC AGRNDCIVDKIRRKNCP ACRL 
PJCCCQAGMVLGGR1CFKKFNKVRVVRAII)AVAIPQPVGWNESQA 
DIQLIPPIJNLLMSIEPDVr^AGHDNTKPDTSSSLLTSLNQLGERQLLSVVKWSKSLPG 
FRNlliroDQITLIQYSWMSLMWGLGWRSYKHVSGQMLYFAPDLrLNEQRMKESSF 
YSLCLTMWQ1PQEFVKLQVSQEEFLCMKVLXLLNT1PLEGLRSQTQFEEMRSSY1REL 

30 IKAIGOIQKGVVSSSQRFYQLTKIXDNIJHDLVKQLHLYCLNTFIQSRALSVEFPEMM 
SEVIAAQLPKILAGMVKPLLFHKK 



62. SEQ ID NO: 62 PR nucleic acid sequence 

35 

1 ctgaccagcg ccgccctccc ccgcccccga cccaggaggt ggagatccct ccggtccagc 
61 cacattcaac acccactttc tcctccctct gcccctatat tcccgaaacc ccctcctcct 
121 tcccttttcc ctcctccctg gagacggggg aggagaaaag gggagtccag tcgtcatgac 
181 tgagctgaag gcaaagggtc cccgggctcc ccacgtggcg ggcggcccgc cctcccccga 

40 241 ggtcggatcc ccactgctgt gtcgcccagc cgcaggtccg ttcccgggga gccagacctc 

301 ggacaccttg cctgaagttt cggccatacc tatctccctg gacgggctac tcttccctcg 
361 gccctgccag ggacaggacc cctccgacga aaagacgcag gaccagcagt cgctgtcgga 
421 cgtggagggc gcatattcca gagctgaagc tacaaggggt gctggaggca gcagttctag 
481 tcccccagaa aaggacagcg gactgctgga cagtgtcttg gacactctgt tggcgccctc 

45 541 aggtcccggg cagagccaac ccagccctcc cgcctgcgag gtcaccagct cttggtgcct 

601 gtttggcccc gaacttcccg aagatccacc ggctgccccc gccacccagc gggtgttgtc 
661 cccgctcatg agccggtccg ggtgcaaggt tggagacagc tccgggacgg cagctgccca 
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721 taaagtgctg ccccggggcc tgtcaccagc ccggcagctg ctgctcccgg cctctgagag 
781 ccctcactgg tccggggccc cagtgaagcc gtctccgcag gccgctgcgg tggaggttga 
841 ggaggaggat ggctctgagt ccgaggagtc tgcgggtccg cttctgaagg gcaaacctcg 
901 ggctctgggt ggcgcggcgg ctggaggagg agccgcggct gtcccgccgg gggcggcagc 
5 961 aggaggcgtc gccctggtcc ccaaggaaga ttcccgcttc tcagcgccca gggtcgccct 

1021 ggtggagcag gacgcgccga tggcgcccgg gcgctccccg ctggccacca cggtgatgga 
1081 tttcatccac gtgcctatcc tgcctctcaa tcacgcctta ttggcagccc gcactcggca 
1141 gctgctggaa gacgaaagtt acgacggcgg ggccggggct gccagcgcct ttgccccgcc 
1201 gcggagttca ccctgtgcct cgtccacccc ggtcgctgta ggcgacttcc ccgactgcgc 

10 1261 gtacccgccc gacgccgagc ccaaggacga cgcgtaccct ctctatagcg acttccagcc 

1321 gcccgctcta aagataaagg aggaggagga aggcgcggag gcctccgcgc gctccccgcg 
1381 ttcctacctt gtggccggtg ccaaccccgc agccttcccg gatttcccgt tggggccacc 
1441 gcccccgctg ccgccgcgag cgaccccatc cagacccggg gaagcggcgg tgacggccgc 
1501 acccgccagt gcctcagtct cgtctgcgtc ctcctcgggg tcgaccctgg agtgcatcct 

15 1561 gtacaaagcg gagggcgcgc cgccccagca gggcccgttc gcgccgccgc cctgcaaggc 

1621 gccgggcgcg agcggctgcc tgctcccgcg ggacggcctg ccctccacct ccgcctctgc 
1681 cgccgccgcc ggggcggccc ccgcgctcta ccctgcactc ggcctcaacg ggctcccgca 
1741 gctcggctac caggccgccg tgctcaagga gggcctgccg caggtctacc cgccctatct 
1801 caactacctg aggccggatt cagaagccag ccagagccca caatacagct tcgagtcatt 

20 1 861 acctcagaag atttgtttaa tctgtgggga tgaagcatca ggctgtcatt atggtgtcct 
1921 tacctgtggg agctgtaagg tcttctttaa gagggcaatg gaagggcagc acaactactt 
1981 atgtgctgga agaaatgact gcatcgttga taaaatccgc agaaaaaact gcccagcatg 
2041 tcgccttaga aagtgctgtc aggctggcat ggtccttgga ggtcgaaaat ttaaaaagtt 
2101 caataaagtc agagttgtga gagcactgga tgctgttgct ctcccacagc cagtgggcgt 

25 2161 tccaaatgaa agccaagccc taagccagag attcactttt tcaccaggtc aagacataca 
2221 gttgattcca ccactgatca acctgttaat gagcattgaa ccagatgtga.tctatgcagg 
2281 acatgacaac acaaaacctg acacctccag ttctttgctg acaagtctta atcaactagg 
2341 cgagaggcaa cttctttcag tagtcaagtg gtctaaatca ttgccaggtt ttcgaaactt 
2401 acatattgat gaccagataa ctctcattca gtattcttgg atgagcttaa tggtgtttgg 

30 2461 tctaggatgg agatcctaca aacacgtcag tgggcagatg ctgtattttg cacctgatct 
2521 aatactaaat gaacagcgga tgaaagaatc atcattctat tcattatgcc ttaccatgtg 
2581 gcagatccca caggagtttg tcaagcttca agttagccaa gaagagttcc tctgtatgaa 
2641 agtattgtta cttcttaata caattccttt ggaagggcta cgaagtcaaa cccagtttga 
2701 ggagatgagg tcaagctaca ttagagagct catcaaggca attggtttga ggcaaaaagg 

35 2761 agttgtgtcg agctcacagc gtttctatca acttacaaaa cttcttgata acttgcatga 
2821 tcttgtcaaa caacttcatc tgtactgctt gaatacattt atccagtccc gggcactgag 
2881 tgttgaattt ccagaaatga tgtctgaagt tattgctgca caattaccca agatattggc 
2941 agggatggtg aaaccccttc tctttcataa aaagtgaatg tcatcttttt cttttaaaga 
3001 attaaatttt gtgg 

40 

63. SEQ ID NO: 63 PR DBD nucleic acid sequence 

. cctcagaag atttgtttaa tctgtgggga tgaagcatca ggctgtcatt atggtgtcct 
1921 tacctgtggg agctgtaagg tcttctttaa gagggcaatg gaagggcagc acaactactt 
45 1981 atgtgctgga agaaatgact gcatcgttga taaaatccgc agaaaaaact gcccagcatg 

2041 tcgccttaga aagtgctgtc aggctggcat ggtcctt 

64. SEQ ID NO: 64 PR hinge domain nucleic acid sequence 
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gga ggtcgaaaat ttaaaaagtt 
2101 caataaagtc agagttgtga gagcactgga tgctgttgct ctcccacagc cagtgggcgt 
2161 tccaaatgaa agccaagccc taagccagag attcactttt tcaccaggtc aagacataca 
2221 gttgattcca ccactga 

5 

65. SEQ ID NO: 65 Thyroid hormone Receptor (TR) amino acid sequence 

MTENGLTAWDKPKHCPDREHD 
10 TTWTSSIFHLDHDDVNDQSVSSAQTFQTEEKKCK^ 

YHYRCITCEGCKGFFRRTIQKNLHPSYSCKYEGKCVIDKVTRNQCQECRF 
MATDLVLDDSKRLAKRK^ 

ATNAQGSHWKQKPKFLPEDIGQAPIVNAPEGGKVDLEAFSHFT 
KKLPMFCELPCEDQDLLKGCCMEIMSLRAAVRYDPESETLTLNG 
15 GLGWSDAIFDLGMSI^SFNLDDTEVALLQAVLLMSSDRPGLACVERIEKY 
AFEHYINYRKHHVTHFWPKLLMKVTDLRMIGAC 
FED 

66. SEQ ID NO: 66 TR nucleic acid sequence 

20 

1 cggcggggat caactttgca tgaataatgt gagtgcgctt ggaaaagaga cctcctgctc 
61 cgcgggctcg gggcaagagc ccgcaggcta ccttccccgg gcaggggcgc tcaacccaac 
121 cggctccagg gcactggtaa tttggctaga ggaccgcgcg gaggcagcgg gatctgcgat 
181 ttccttctgg ttggctgtcc tgcgtgggtg ccaagttcca cacatgattt aatgaataag 
25 241 aaggagatgt cagtgaaaaa agggatccag aatgattact aacctataac ccccaacagt 

301 atgacagaaa atggccttac agcttgggac aaaccgaagc actgtccaga ccgagaacac 
361 gactggaagc tagtaggaat gtctgaagcc tgcctacata ggaagagcca ttcagagagg 
421 cgcagcacgt tgaaaaatga acagtcgtcg ccacatctca tccagaccac ttggactagc 
481 tcaatattcc atctggacca tgatgatgtg aacgaccaga gtgtctcaag tgcccagacc 
30 541 ttccaaacgg aggagaagaa atgtaaaggg tacatcccca gttacttaga caaggacgag 

601 ctctgtgtag tgtgtggtga caaagccacc gggtatcact accgctgtat cacgtgtgaa 
661 ggctgcaagg gtttctttag aagaaccatt cagaaaaatc tccatccatc ctattcctgt 
721 aaatatgaag gaaaatgtgt catagacaaa gtcacgcgaa atcagtgcca ggaatgtcgc 
781 tttaagaaat gcatctatgt tggcatggca acagatttgg tgctggatga cagcaagagg 
35 841 ctggccaaga ggaagctgat agaggagaac cgggagaaaa gacggcggga agagctgcag 

901 aagtccatcg ggcacaagcc agagcccaca gacgaggaat gggagctcat caaaactgtc 
961 accgaagccc atgtggcgac caacgcccaa ggcagccact ggaagcaaaa accgaaattt 
1021 ctgccagaag acattggaca agcaccaata gtcaatgccc cagaaggtgg aaaggttgac 
1081 ttggaagcct tcagccattt tacaaaaatc atcacaccag caattaccag agtggtggat 
40 1 141 tttgccaaaa agttgcctat gttttgtgag ctgccatgtg aagaccagat catcctcctc 

1201 aaaggctgct gcatggagat catgtccctt cgcgctgctg tgcgctatga cccggaaagt 
1261 gagactttaa ccttgaatgg ggaaatggca gtgatacggg gccagctgaa aaatgggggt 
1321 cttggggtgg tgtcagacgc catctttgac ctaggcatgt ctctgtcttc tttcaacctg 
1381 gatgacactg aagtagccct ccttcaggcc gtcctgctga tgtcttcaga tcgcccgggg 
45 1441 cttgcctgtg ttgagagaat agaaaagtac caagatagtt tcctgctggc ctttgaacac 

1501 tatatcaatt accgaaaaca ccacgtgaca cacttttggc caaaactcct gatgaaggtg 
1561 acagatctgc ggatgatagg agcctgccat gccagccgct tcctgcacat gaaggtggaa 
1621 tgccccacag aactcctccc ccctttgttc ctggaagtgt tcgaggatta gactgactgg 
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1681 attccttcct ataattcc 



67. SEQ ID NO: 67 TR DBD nucleic acid sequence 

5 aaggacgag 
601 ctctgtgtag tgtgtggtga caaagccacc gggtatcact accgctgtat cacgtgtgaa 
661 ggctgcaagg gtttctttag aagaaccatt cagaaaaatc tccatccatc ctattcctgt 
721 aaatatgaag gaaaatgtgt catagacaaa gtcacgcgaa atcagtgcca ggaatgtcgc 
781 tttaagaaat gcatctatgt tggcatg 

10 

68. SEQ ID NO: 68 TR hinge domain nucleic acid sequence 

gca acagatttgg tgctggatga cagcaagagg 
841 ctggccaaga ggaagctgat agaggagaac cgggagaaaa gacggcggga agagctgcag 
15 901 aagtccatcg ggcacaagcc agagcccaca gacgaggaat gggagctcat caaaactgtc 

961 accgaagccc atgtggcgac caacgcccaa ggcagc 



20 

69. SEQ ID NO: 69 Retinoic X Receptor a (RXRa) amino acid sequence 

MDTKHFLPIJDFSTQVNSSLTSPTG 

GMGPPFSVISSPMGPHSMSVPTTPTLGFSTGSPQI^SPM^VSSSEDIKPPLGLNGVL 
25 KWAHPSGNMASFTKfflC^ 
NKDCLIDKRQRNRCQYCR 

EDMPVERn^AELAXTEPKTETYVEANMGLNPSSPNDPVT^ 
KRIPHFSELPLDDQVILLRAGWNELLIASFSHRSIAVXD 
VGAIFDRVLTELVSKMRDMQMDK^ 
30 ASLEAYCKHKYPEQPGRFAKLLLRLPA^ 
EAPHQMT 

70. SEQ ID NO: 70 RXRa amino acid sequence 

35 1 gcgccggggg ccgccgcgcc cgccgcccgc tgcctgcgcc gccggccggg catgagttag 

61 tcgcagacat ggacaccaaa catttcctgc cgctcgattt ctccacccag gtgaactcct 
T21 ccctcacctc cccgacgggg cgaggctcca tggctgcccc ctcgctgcac ccgtccctgg 
181 ggcctggcat cggctccccg ggacagctgc attctcccat cagcaccctg agctccccca 
241 tcaacggcat gggcccgcct ttctcggtca tcagctcccc catgggcccc cactccatgt 

40 301 cggtgcccac cacacccacc ctgggcttca gcactggcag cccccagctc agctcaccta 

361 tgaaccccgt cagcagcagc gaggacatca agccccccct gggcctcaat ggcgtcctca 
421 aggtccccgc ccacccctca ggaaacatgg cttccttcac caagcacatc tgcgccatct 
481 gcggggaccg ctcctcaggc aagcactatg gagtgtacag ctgcgagggg tgcaagggct 
541-tcttcaagcg gacggtgcgc aaggacctga cctacacctg ccgcgacaac aaggactgcc 

45 601 tgattgacaa gcggcagcgg aaccggtgcc agtactgccg ctaccagaag tgcctggcca 

661 tgggcatgaa gcgggaagcc gtgcaggagg agcggcagcg tggcaaggac cggaacgaga 
721 atgaggtgga gtcgaccagc agcgccaacg aggacatgcc ggtggagagg atcctggagg 
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781 ctgagctggc cgtggagccc aagaccgaga cctacgtgga ggcaaacatg gggctgaacc 
841 ccagctcgcc gaacgaccct gtcaccaaca tttgccaagc agccgacaaa cagcttttca 
901 ccctggtgga gtgggccaag cggatcccac acttctcaga gctgcccctg gacgaccagg 
961 tcatcctgct gcgggcaggc tggaatgagc tgctcatcgc ctccttctcc caccgctcca 
5 1 02 1 tcgccgtgaa ggacgggatc ctcctggcca ccgggctgca cgtccaccgg aacagcgccc 
1081 acagcgcagg ggtgggcgcc atctttgaca gggtgctgac ggagcttgtg tccaagatgc 
1141 gggacatgca gatggacaag acggagctgg gctgcctgcg cgccatcgtc ctctttaacc 
1201 ctgactccaa ggggctctcg aacccggccg aggtggaggc gctgagggag aaggtctatg 
1261 cgtccttgga ggcctactgc aagcacaagt acccagagca gccgggaagg ttcgctaagc 

10 1321 tcttgctccg cctgccggct ctgcgctcca tcgggctcaa atgcctggaa catctcttct 

1381 tcttcaagct catcggggac acacccattg acaccttcct tatggagatg ctggaggcgc 
1441 cgcaccaaat gacttaggcc tgcgggccca tcctttgtgc ccacccgttc tggccaccct 
1501 gcctggacgc cagctgttct tctcagcctg agccctgtcc ctgcccttct ctgcctggcc 
1561 tgtttggact ttggggcaca gcctgtcact gctctgccta agagatgtgt tgtcaccctc 

15 1 62 1 cttatttctg ttactacttg tctgtggccc agggcagtgg ctttcctgag gcagcagcct 

1 681 tcgtggcaag aactagcgtg agcccagcca ggcgcctccc caccgggctc tcaggacacc 
1741 ctgccacacc ccacggggct tgggcgacta cagggtcttc gggccccagc cctggagctg 
1 801 caggagttgg gaacggggct tttgtttccg ttgctgttta tcgatgctgg ttttcagaat 
1861 tcctgtgtgg ccctcctgtc tggagtgaca tcttcatctg ctctgaatac tggtgcccag 

20 1921 ccagcccgtg acagcttccc cctaatcagg aggggacagc tgggggcgca agctggtgtg 
1981 tcatcagcaa agacctcagc cgcctcgggg atgagagggg actcgtgggg caagcaagct 
2041 gccctgtgct ctgagtgagg gggaaggtag cccctttttc caaagataac tcacagtttt 
2101 gccctcgagc caatgagaac atgagctgcc ctctgtgcaa ggtttcgggg ccacctccag 
2161 gctgcagggg cgggtcactc acccccctgt tttctctctg ccttggtgtt ctggtttcag 

25 2221 actcccgact ccccgttcag accagagtgc cccggcccct ccccagcctg agtcttctcc 
2281 ttgctctgcg gggtgggctg aggcttgtcc ttgtttcctg cagggctggc cctggctcgg 
2341 gcagggtggg gcatcaccac ctcactggcc ttgctggagg cacagggctc tgcggacctg 
2401 cagccatctg tgaggcccgc ggggatggga ggggaggagg gtggcctgtt ggtttccctc 
2461 agagggggca ggtggcctgg agagagaggg gctcaggaac tgggagcctc gtgggtgggg 

30 2521 cagatgctcc gcggcctgga gtggctctgc cggggcattg gtgggacccc tgctcaggcc 
2581 ttctctctgg ctgccagttg tgtctaaaag actcttggaa tctgagaacc cggagtcgca 
2641 gcgccctcgg gcctgggcca cacgcaggcc ctggtgggac cacccagcct ggtattgtcc 
2701 acggacagcg ttgttcaccc agagccttac ttgggagcct cactgaacgc ctgctctggt 
2761 tgaaggtggg gtgggggcgg ggcttggggc ctccctggct cagcccagtg cggcctggcg 

35 2821 ctcctcccgc aggctctgcc cccgggctcc ggtggtgcgg ggccctctca ggttgaactc 
2881 gcctcttttg cactggaagg ccctcccttt ggcctgagta cttttcccgt tcacgcctca 
2941 gtcccgtgga cccagccttt gtcagtggca ggtgcctgaa cagagggtgg atggggggga 
3001 taccggaggg ggtcttgtct tcccagccgc agtctaggaa tgatgcgggg gggtggacgc 
3061 cttctccata gtctttcccc acctggagca ggggcttcct cagtggtgag gggagctgcc 

40 3121 tacaggttgg accgggaggc agtggcttgg agaggcagct ttccagcctt ggtggggaag 
3181 aaagtgtcca ttctttgcct tcctggagct cccagccaga gctgagctta ggcacccgag 
3241 tggagcctgc agctgagtct gtgcccgaga caggctgtca gagattccag aagcctctcc 
3301 tccccgccgc cctccacccc tgcctttcag cgttgtggat ccctagaggt ggccccctgc 
3361 ccgatccacc gtcctgaggc agagtgttga gcctcatacc tgtaccaggt ccccggccag 

45 3421 ctgggcccct cccaggcact gccaggaagc cccagctgcc cctggcgggt gtggtggaaa 
3481 tggcaggagg gtgcaggtac tcttggggcc ccagcggtgg gagtgcaaaa gacccaacgc 
3541 caacacctgg tgccttttgc agccagcgcc cacccatccg tgcccggacc cttgggaatg 
3601 cccgcggctc cagaggaaaa agcccaggga cggggcctcc gttgcggggg gtcggctgct 
3661 tcttgggaac tttgtcgttt ccggcgctgg ctggctggct ggctgtaaag cactgaagcc 
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3721 ccccggccgc caacccctga aagcagaacc tggcctccct ggccacagca gccttaccca 
3781 ccgctctacg tgtcccgggc acttcccgca gccttcccgt ccctttctca tcggccttgt 
3841 agttgtacag tgctgttggt ttgaaaaggt gatgtgtggg gagtgcggct catcactgag 
3901 tagagaggta gaatttctat ttaaccagac ctgtagtagt attaccaatc cagttcaatt 
5 3961 aaggtgattt tttgtaatta ttattatttt ggtgggacaa tctttaattt tctaaagata 

4021 gcactaacat cagctcatta gccacctgtg cctgtccccg ccttggcccg gctggatgaa 
4081 gcggcttccc cgcagggccc ccacttccca gtggctgctt cctggggacc cagggcaccc 
4141 cggcaccttc aggcacgctc ctcagctggt cacctcccgg ctttgccgtt cagatggggc 
4201 tcctgaggct caggagtgaa gatgccacag agccgggctc ccctaggctg cgtcgggcat 

10 4261 gcttggaagc tggcctgcca ggaccttcca ccctggggcc tgtgtcagcc gccggccctc 
4321 cgcaccctgg aagcacacgg cctctgggaa ggacagccct gaccttcggt tttccgagca 
4381 cggtgtttcc caagaattct gggctggcgg cctggtggca gtgctggaga tgaccccgag 
4441 cccctccccg tggggcaccc aggagggccc tgccggaatg tgcagcctgt gggtagtcgg 
4501 ctggtgtccc tgtcgtggag ctggggtgcg tgatctggtg ctcgtccacg caggtgtgtg 

15 456 1 gtgtaaacat gtatgtgctg tacagagaga cgcgtgtgga gagagccgca caccagcgcc 
4621 acccaggaaa ggcggagcgg ttaccagtgt tttgtgttta tttttaatca agacgtttcc 
4681 cctgttttcc tataaatttg cttcgtgtaa gcaagtacat aaggaccctc ctttggtgaa 
4741 atccgggttc gaatgaatat ctcaaggcag gagatgcatc tattttaaga tgctttggag 
4801 cagacagctt tagccgttcc caatccttag caatgcctta gctgggacgc atagctaata 

20 4861 ctttagagag gatgacagat ccataaagag agtaaagata agagaaaatg tctaaagcat 
4921 ctggaaaggt aaaaaaaaaa aatctatttt tgtacaaatg taattttatc cctcatgtat 
4981 acttggatat ggcgggggga gggctgggac tgtttcgttt ctgcttctag agattgaggt 
5041 gaaagcttcg tccgagaaac gccaggacag acgatggcag aggagagggc tcctgtgacg 
5101 gcggcgaggc ttgggaggaa accgccgcaa tgggggtgtc ttccctcggg gcaggagggt 

25 5161 gggcctgagg ctttcaaggg ttttcttccc tttcgagtaa tttttaaagc cttgctctgt 
5221 tgtgtcctgt tgccggctct ggccttcctg tgactgactg tgaagtggct tctccgtacg 
5281 attgtctctg aaacatcgtg gcctcaggtg ccagggtttg atggacagta gcattagaat 
5341 tgtggaaaag gaacacgcaa agggagaagt gtgagaggag aaacaaaata tgagcgttta 
5401 aaatacatcg ccattcagtt cgttaaaaaa aaaaaaaaaa aaaaaaaaa 

30 

71. SEQ ID NO: 71 RXRa DBD nucleic acid sequence 

ac caagcacatc tgcgccatct 

481 gcggggaccg ctcctcaggc aagcactatg gagtgtacag ctgcgagggg tgcaagggct 
541 tcttcaagcg gacggtgcgc aaggacctga cctacacctg ccgcgacaac aaggactgcc 
35 601 tgattgacaa gcggcagcgg aaccggtgcc agtactgccg ctaccagaag tgcctggcca 

661 tgggcatg 

72. SEQ ID NO: 72 RXRa hinge domain nucleic acid sequence 

aa gcgggaagcc gtgcaggagg agcggcagcg tggcaaggac cggaacgaga 
40 721 atgaggtgga gtcgaccagc agcgccaacg aggacatgcc ggtggagagg atcctggagg 

781 ctgagctggc cgtggagccc aagaccgaga cctacgtgga ggcaaacatg gggctgaacc 
841 ccagc 



45 73. SEQ ID NO: 73 Retinoic Acid Receptor a (RARa) amino acid sequence 



MASNSSSCPTPGGGHLNGYPWPYAFFFPPMLGGLSPPGALTTLQHQLPVSGYST^ 
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PATffiTQSSSSEEIWSPPSPPPLPRIYKPCFVCQDKSSGYHYGVSACEGCKGFFRRSIQ 

KNMVYTCHRDKNCIINKVTRNRCQYCRLQKCFEVGMSKES 

PECSESYTLTPEVGELffiKVRKAHQETFPALCQLGKYTTNNSSEQRVSLDro 

SELSTKCIIKTVEFAKQLPGFTTLTL^ 
5 GLTimTQMHNAGFGPLTDLWAFANQLIJLEMDDAETGLI^AICUCGDRQDLEQ 
DRVDMLQEPLLEALKVYVRKRRPSRPHMFPKMLNm 

IPGSMPPLIQEMLENSEGLDTLSGQPGGGGRDGGGLAPPPGSCSPSLSPSSNRSSPAT 
HSP 

10 74. SEQ ID NO: 74 RARa nucleic acid sequence 

1 gccatctggg cccaggcccc atgccccgag gaggggtggt ctgaagccca ccagagcccc 
61 ctgccagact gtctgcctcc cttctgactg tggccgcttg gcatggccag caacagcagc 
121 tcctgcccga cacctggggg cgggcacctc aatgggtacc cggtgcctcc ctacgccttc 

15 181 ttcttccccc ctatgctggg tggactctcc ccgccaggcg ctctgaccac tctccagcac 

241 cagcttccag ttagtggata tagcacacca tccccagcca ccattgagac ccagagcagc 
301 agttctgaag agatagtgcc cagccctccc tcgccacccc ctctaccccg catctacaag 
361 ccttgctttg tctgtcagga caagtcctca ggctaccact atggggtcag cgcctgtgag 
421 ggctgcaagg gcttcttccg ccgcagcatc cagaagaaca tggtgtacac gtgtcaccgg 

20 481 gacaagaact gcatcatcaa caaggtgacc cggaaccgct gccagtactg ccgactgcag 

541 aagtgctttg aagtgggcat gtccaaggag tctgtgagaa acgaccgaaa caagaagaag 
601 aaggaggtgc ccaagcccga gtgctctgag agctacacgc tgacgccgga ggtgggggag 
661 ctcattgaga aggtgcgcaa agcgcaccag gaaaccttcc ctgccctctg ccagctgggc 
721 aaatacacta cgaacaacag ctcagaacaa cgtgtctctc tggacattga cctctgggac 

25 781 aagttcagtg aactctccac caagtgcatc attaagactg tggagttcgc caagcagctg 

841 cccggcttca ccaccctcac catcgccgac cagatcaccc tcctcaaggc tgcctgcctg 
901 gacatcctga tcctgcggat ctgcacgcgg tacacgcccg agcaggacac catgaccttc 
961 tcggacgggc tgaccctgaa ccggacccag atgcacaacg ctggcttcgg ccccctcacc 
1021 gacctggtct ttgccttcgc caaccagctg ctgcccctgg agatggatga tgcggagacg 

30 1 081 gggctgctca gcgccatctg cctcatctgc ggagaccgcc aggacctgga gcagccggac 
1141 cgggtggaca tgctgcagga gccgctgctg gaggcgctaa aggtctacgt gcggaagcgg 
1201 aggcccagcc gcccccacat gttccccaag atgctaatga agattactga cctgcgaagc 
1261 atcagcgcca agggggctga gcgggtgatc acgctgaaga tggagatccc gggctccatg 
1321 ccgcctctca tccaggaaat gttggagaac tcagagggcc tggacactct gagcggacag 

35 1381 ccggggggtg gggggcggga cgggggtggc ctggcccccc cgccaggcag ctgtagcccc 

1441 agcctcagcc ccagctccaa cagaagcagc ccggccaccc actccccgtg accgcccacg 
1501 ccacatggac acagccctcg ccctccgccc cggcttttct ctgcctttct accgaccatg 
1561 tgaccccgca ccagccctgc ccccacctgc cctcccgggc agtactgggg accttccctg 
1621 ggggacgggg agggaggagg cagcgactcc ttggacagag gcctgggccc tcagtggact 

40 1681 gcctgctccc acagcctggg ctgacgtcag aggccgaggc caggaactga gtgaggcccc 
1741 tggtcctggg tctcaggatg ggtcctgggg gcctcgtgtt catcaagaca cccctctgcc 
1801 cagctcacca catcttcatc accagcaaac gccaggactt ggctccccca tcctcagaac 
1861 tcacaagcca ttgctcccca gctggggaac ctcaacctcc cccctgcctc ggttggtgac 
1921 agagggggtg ggacaggggc ggggggttcc ccctgtacat accctgccat accaacccca 

45 1981 ggtattaatt ctcgctggtt ttgtttttat tttaattttt ttgttttgat ttttttaata 

2041 agaattttca ttttaagcac atttatactg aaggaatttg tgctgtgtat tggggggagc 

2101 tggatccaga gctggagggg gtgggtccgg gggagggagt ggctcggaag gggcccccac 

2161 tctcctttca tgtccctgtg ccccccagtt ctcctcctca gccttttcct cctcagtttt 
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2221 ctctttaaaa ctgtgaagta ctaactttcc aaggcctgcc ttcccctccc tcccactgga 
2281 gaagccgcca gcccctttct ccctctgcct gaccactggg tgtggacggt gtggggcagc 
2341 cctgaaagga caggctcctg gccttggcac ttgcctgcac ccaccatgag gcatggagca 
2401 gggcagagca agggccccgg gacagagttt tcccagacct ggctcctcgg cagagctgcc 
5 2461 tcccgtcagg gcccacatca tctaggctcc ccagccccca ctgtgaaggg gctggccagg 
2521 ggcccgagct gcccccaccc ccggcctcag ccaccagcac ccccataggg cccccagaca 
2581 ccacacacat gcgcgtgcgc acacacacaa acacacacac actggacagt agatgggccg 
2641 acacacactt ggcccgagtt cctccatttc cctggcctgc cccccacccc caacctgtcc 
2701 cacccccgtg ccccctcctt accccgcagg acgggcctac aggggggtct cccctcaccc 
10 2761 ctgcaccccc agctggggga gctggctctg ccccgacctc cttcaccagg ggttggggcc 
2821 ccttcccctg gagcccgtgg gtgcacctgt tactgttggg ctttccactg agatctactg 
2881 gataaagaat aaagttctat ttattct 

75. SEQ ID NO: 75 RARct DBD nucleic acid sequence 

15 atctacaag 
361 ccttgctttg tctgtcagga caagtcctca ggctaccact atggggtcag cgcctgtgag 
421 ggctgcaagg gcttcttccg ccgcagcatc cagaagaaca tggtgtacac gtgtcaccgg 
481 gacaagaact gcatcatcaa caaggtgacc cggaaccgct gccagtactg ccgactgcag 
541 aagtgctttg aagtgggcat gtccaag 



20 



40 



76. SEQ ID NO: 76 RARct hinge domain nucleic acid sequence 



gag tctgtgagaa acgaccgaaa caagaagaag 
601 aaggaggtgc ccaagcccga gtgctctgag agctacacgc tgacgccgga ggtgggggag 
661 ctcattgaga aggtgcgcaa agcgcaccag gaaaccttcc ctgccctctg ccagctgggc 
25 721 aaatacacta cgaacaacag ctcagaacaa cgtgtctctc tggacattga cctc 

77. SEQ ID NO: 77 Vitamin D Receptor (VDR) amino acid sequence 

30 MEAMAASTSLPDPGDFDRNWWC 

ALFTCPFNGDCRITKDNRRHCQACRLKRCVDIGMMKEFI^ 
EEEALKDSLRPKLSEEQQRIIAILLDAHHKTYDPTYSDFCQFRPPW 
PNSI^TPSFSGDSSSSCSDHCITSSDMMDSSSFSNLDLSEEDSDDPSVTLELSQl^MLP 
HLADLVSYSIQKVIGFAKMIPGFRDLTSE 

35 TCGNQDYKYRVSDVTKAGHSLELffiPLnCFQVGLKKLNLHEEEHVLLM 

PGVQDAALffiAIQDRLSNTLQTYmCRHPPPGSHLLYAKMIQKLADLRSL^ 
YRCLSFQPECSMKLTPLVLEVFGNEIS 



78. SEQ ID NO: 78 VDR nucleic acid sequence 



1 atggaggcaa tggcggccag cacttccctg cctgaccctg gagactttga ccggaacgtg 
61 ccccggatct gtggggtgtg tggagaccga gccactggct ttcacttcaa tgctatgacc 
121 tgtgaaggct gcaaaggctt cttcaggcga agcatgaagc ggaaggcact attcacctgc 
181 cccttcaacg gggactgccg catcaccaag gacaaccgac gccactgcca ggcctgccgg 
45 241 ctcaaacgct gtgtggacat cggcatgatg aaggagttca ttctgacaga tgaggaagtg 

301 cagaggaagc gggagatgat cctgaagcgg aaggaggagg aggccttgaa ggacagtctg 
361 cggcccaagc tgtctgagga gcagcagcgc atcattgcca tactgctgga cgcccaccat 
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421 aagacctacg accccaccta ctccgacttc tgccagttcc ggcctccagt tcgtgtgaat 
481 gatggtggag ggagccatcc ttccaggccc aactccagac acactcccag cttctctggg 
541 gactcctcct cctcctgctc agatcactgt atcacctctt cagacatgat ggactcgtcc 
601 agcttctcca atctggatct gagtgaagaa gattcagatg acccttctgt gaccctagag 

5 661 ctgtcccagc tctccatgct gccccacctg gctgacctgg tcagttacag catccaaaag 

721 gtcattggct ttgctaagat gataccagga ttcagagacc tcacctctga ggaccagatc 
781 gtactgctga agtcaagtgc cattgaggtc atcatgttgc gctccaatga gtccttcacc 
841 atggacgaca tgtcctggac ctgtggcaac caagactaca agtaccgcgt cagtgacgtg 
901 accaaagccg gacacagcct ggagctgatt gagcccctca tcaagttcca ggtgggactg 

10 961 aagaagctga acttgcatga ggaggagcat gtcctgctca tggccatctg catcgtctcc 

1021 ccagatcgtc ctggggtgca ggacgccgcg ctgattgagg ccatccagga ccgcctgtcc 
1081 aacacactgc agacgtacat ccgctgccgc cacccgcccc cgggcagcca cctgctctat 
1 141 gccaagatga tccagaagct agccgacctg cgcagcctca atgaggagca ctccaagcag 
1201 taccgctgcc tctccttcca gcctgagtgc agcatgaagc taacgcccct tgtgctcgaa 

15 1261 gtgtttggca atgagatctc ctga 

79. SEQ ID NO: 79 VDR DBD nucleic acid sequence 

cggatct gtggggtgtg tggagaccga gccactggct ttcacttcaa tgctatgacc 
20 121 tgtgaaggct gcaaaggctt cttcaggcga agcatgaagc ggaaggcact attcacctgc 

181 cccttcaacg gggactgccg catcaccaag gacaaccgac gccactgcca ggcctgccgg 
241 ctcaaacgct gtgtggacat cggcatgatg aag 



80. SEQ ID NO: 80 VDR hinge domain nucleic acid sequence 

25 gagttca ttctgacaga tgaggaagtg 

301 cagaggaagc gggagatgat cctgaagcgg aaggaggagg aggccttgaa ggacagtctg 
361 cggcccaagc tgtctgagga gcagcagcgc atcattgcca tactgctgga cgcccaccat 
421 aagacctacg accccaccta ctccgacttc tgccagttcc ggcctcca 

30 81. SEQ ID NO: 81 ERa Hinge Domain Amino Acid Sequence 

MKGGVRKDRRGGRMLKHKRQRDDGEGRGEVGSAGDMRAANLWPSPLMIKRS 

82. SEQ ID NO: 82 c-Myb DNA binding Domain Amino Acid Sequence 

MARRPRHSIYSSDEDDEDFEMCDHDYDGLLPKSGKRHLGKTRWTREE 

35 

83. SEQ ID NO: 83 Hinge Domain Nucleic Acid Sequence 

CTCTGGATCCGGTGGAGGTGGTTCTGGAGGAGGTGGTTCCGGAGGTGGAGGAA 
AGGAGACGCGTTACGCTG 

40 

84. SEQ ID NO: 84 Hinge Domain Amino Acid Sequence 

LWIRWRWFWRRWFRRWRKGDALTL 
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